Leveraging OpenStack and Ceph for a Controlled-Access Data Cloud

07/23/2018
by   Evan F. Bollig, et al.
0

While traditional HPC has and continues to satisfy most workflows, a new generation of researchers has emerged looking for sophisticated, scalable, on-demand, and self-service control of compute infrastructure in a cloud-like environment. Many also seek safe harbors to operate on or store sensitive and/or controlled-access data in a high capacity environment. To cater to these modern users, the Minnesota Supercomputing Institute designed and deployed Stratus, a locally-hosted cloud environment powered by the OpenStack platform, and backed by Ceph storage. The subscription-based service complements existing HPC systems by satisfying the following unmet needs of our users: a) on-demand availability of compute resources, b) long-running jobs (i.e., > 30 days), c) container-based computing with Docker, and d) adequate security controls to comply with controlled-access data requirements. This document provides an in-depth look at the design of Stratus with respect to security and compliance with the NIH's controlled-access data policy. Emphasis is placed on lessons learned while integrating OpenStack and Ceph features into a so-called "walled garden", and how those technologies influenced the security design. Many features of Stratus, including tiered secure storage with the introduction of a controlled-access data "cache", fault-tolerant live-migrations, and fully integrated two-factor authentication, depend on recent OpenStack and Ceph features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/23/2018

From Bare Metal to Virtual: Lessons Learned when a Supercomputing Institute Deploys its First Cloud

As primary provider for research computing services at the University of...
research
07/30/2021

Cloud to Ground Secured Computing: User Experiences on the Transition from Cloud-Based to Locally-Sited Hardware

The application of high-performance computing (HPC) processes, tools, an...
research
09/26/2020

Machine Learning Algorithms for Active Monitoring of High Performance Computing as a Service (HPCaaS) Cloud Environments

Cloud computing provides ubiquitous and on-demand access to vast reconfi...
research
12/23/2020

Enabling Secure and Effective Biomedical Data Sharing through Cyberinfrastructure Gateways

Dynaswap project reports on developing a coherently integrated and trust...
research
10/17/2019

A Framework for Secure Digital Administration

The efficiency and service quality in public administration can be impro...
research
01/08/2018

P-MOD: Secure Privilege-Based Multilevel Organizational Data-Sharing in Cloud Computing

Cloud computing has changed the way enterprises store, access and share ...
research
11/03/2021

Implementing a scalable and elastic computing environment based on Cloud Containers

In this article we look at the potential of cloud containers and we prov...

Please sign up or login with your details

Forgot password? Click here to reset