OSiRIS is a pilot project funded by the NSF to evaluate a software-defined storage infrastructure for our primary Michigan research universities. OSiRIS will combine a number of innovative concepts to provide a distributed, multi-institutional storage infrastructure that will allow researchers at any of our three campuses to read, write, manage and share their data directly from their computing facility locations.
Our goal is to provide transparent, high-performance access to the same storage infrastructure from well-connected locations on any of our campuses. We intend to enable this via a combination of network discovery, monitoring and management tools and through the creative use of CEPH features.
By providing a single data infrastructure that supports computational access on the data “in-place”, we can meet many of the data-intensive and collaboration challenges faced by our research communities and enable these communities to easily undertake research collaborations beyond the border of their own Universities.
The International Conference for High Performance Computing, Networking, Storage, and Analysis
November 11–16, 2018
Kay Bailey Hutchison Convention Center, Dallas, Texas
Members of the OSiRIS team traveled to SC18 to setup in the University of Michigan and Michigan State University combined booth!
We setup up a rack of equipment designated for use by OSiRIS, SLATE, and AGLT2 demos at SC. The rack was shipped as a unit from Michigan and was waiting for us to plug it in and set up when we got to the conference.
At Supercomputing 2018 the OSiRIS project configured a Ceph storage node with 60 OSD to host cache tier pools. This gave us approximately 600TB of raw caching space, or 200TB with 3x replicated pools. For purposes of testing we replicated pools on just a single host but in a production setup a more resilient set of OSD would be desirable.
For more information please see our full article covering the configuration and performance of the Van Andel site.
Building on existing collaboration between MSU and the Grand Rapids based Van Andel Institute, OSiRIS has deployed NVMe-based Ceph OSD nodes and an NFS gateway at the institute to enable direct access to bioinformatics research data. OSIRIS at VAI will enable VAI bioinformaticians to work with MSU researchers to better understand Parkinson’s disease and cance. OSiRIS facilitates data access for VAI researchers to leverage the computational resources at MSU Institute for Cyber Enabled Research.
The OSiRIS site at Van Andel is deployed and managed similar to other OSiRIS sites. The 3 nodes there are part of the multi-institutional OSiRIS cluster and OSD are partitioned into a separate Ceph Crush tree to be used in rules defining cache tier pools.
My Do, University of Michigan
The Ceph-mgr plugin, influx, was written to simplify the process of retrieving cluster performance counters and to visualizing them on a dashboard. This influx plugin will periodically retrieve the performance counter and send data points to influxdb where it can be queried by a dashboard such as Grafana.
OSiRIS recently finished a 6-month engagement with the Center for Trustworthy Scientific Cyberinfrastructure which is featured on the CTSC Blog. The 2016-2017 CTSC-OSiRIS collaborative design review of OSiRIS Access Assertions produced a set of security recommendations documented in this report that the OSiRIS project plans to implement in its deployed cyberinfrastructure. CTSC identified no significant weaknesses in its review of the initial design of the OSiRIS access control system.
From our perspective, the CTSC staff was helpful in ensuring that we had a well planned and secure design for OAA. The engagment process was extremely valuable and we extend our thanks to CTSC!