A Petabyte-scale Scientific Community Cloud

The OSDC enables scientific researchers to easily manage, share, and analyze large datasets.

What is the OSDC? >> Watch a Video

OSDC in brief

The Open Science Data Cloud provides the scientific community with resources for storing, sharing, and analyzing terabyte and petabyte-scale scientific datasets. The OSDC is a data science ecosystem in which researchers can house and share their own scientific data, access complementary public datasets, build and share customized virtual machines with whatever tools necessary to analyze their data, and perform the analysis to answer their research questions. It is a one-stop shop for making scientific research faster and easier.

Why is there a need?

With datasets growing larger and larger, researchers are finding that the bottleneck to discovery is no longer a lack of data but an inability to manage, analyze, and share their large datasets. Individual researchers can no longer download and analyze the important datasets in their scientific fields on their own computers. The goal of the Open Science Data Cloud is to remove the bottleneck to discovery by providing researchers with access to a variety of key datasets across scientific disciplines and the computing infrastructure to allow scientists to easily manage and share their data and analysis.>> read more

Featured on the OSDC

Bionimbus Bionimbus
Project Matsu Project Matsu
Tukey Tukey
What is the OSDC? What is the OSDC?



OCC Featured in Campus Technology

The Open Cloud Consortium is featured in an article in Campus Technology that showcases our efforts to provide the research community with the tools they need for discovery. The full piece by David Raths can be found here. "We started before the current interest at NSF and other funding agencies in big data and data science," said [Dr. Robert] Grossman, who is a professor in the division of biological sciences at the University of Chicago (IL). "There just wasn't an interest in data-intensive science or big data or supporting data repositories at scale." The OCC's foundational approach to Data Commons: ... more ...

OSDC PIRE Fellowship Deadline Extended

The application deadline for OSDC Partnerships for International Research and Education (PIRE) fellowships has been extended to March 31st. Interested graduate students, senior undergrads, postdocs, or early career faculty actively conducting research in a field involving computing or a data intensive science should apply now. The goal of OSDC PIRE is to provide training that makes it easier for scientists to use cloud computing for their data intensive researh. OSDC PIRE fellowships are fully-funded summer fellowships to work abroad and gain hands-on experience with large datasets, data intensive projects, cloud computing, and other large scale computing environments. OSDC PIRE workshops bring together a diverse group of researchers to think about the challenges and solutions that are facing the data intensitve research community. The workshops provide opportunities to create practical applications that will improve the communities' research efficacy. Says former PIRE Fellow and lead software developer for the OSDC Tukey Web Portal, Matt Greenway: “The idea to leverage Horizon came from an OSDC PIRE workshop in Edinburgh. Horizon is a nice interface that a lot of work has gone into and we want Tukey to be able to use all of Horizon’s new features while adding features to Tukey that are unrelated to OpenStack.” ... more ...

OCC Announced as a Matter Member

Today we're pleased to announce that the OCC has joined the Matter community. OCC staff will have a dedicated workspace in the Matter ecosystem in Chicago's historic Merchandise Mart. "Matter, which opened Monday, aims to grow companies through mentorship, networking opportunities, partnerships and affordable workspace. Early this week, the incubator announced a partnership with the American Medical Association, which will include an Interaction Studio where entrepreneurs and physicians will collaborate and test new technologies, services and products." Learn more about Matter here. ... more ...

OSDC Milestone - Over 700 Resource Allocations Granted

Today we're proud to announce another Open Science Data Cloud milestone. Over 700 resource allocations have been granted to researchers to use the OSDC and the list of publications that utilized the OSDC continues to grow. As demand for resources to store, share, and analyze terabyte and petabyte scale datasets continues to grow, so too does the OSDC ecosystem. Individuals granted OSDC resource allocations can house and share their own scientific data, access datasets in our Public Data Commons directly mounted to their virtual machines, build and share customized virtual machines with tools for data analysis, and then perform the analysis to answer their research questions. The OSDC is a one-stop shop for making scientific research faster and easier. ... more ...

Genomics Data Commons @ University of Chicago Announced

The University of Chicago announced today their collaboration with the National Cancer Institute to establish the Genomic Data Commons. The Genomic Data Commons project will help researchers around the country assess genetic information from more than 10,000 cancer patients, which could be used to develop more effective treatments, said Robert Grossman, a professor of medicine at University of Chicago who is directing the project. From the Chicago Tribune ... more ...

How can I get involved?


Access the Public Data Commons

The OSDC has 1 PB of publicly accessible data in a wide variety of disciplines. Interested researchers can freely access and download these data to their own machines or apply for resources to compute over the data within the cloud.

Contribute to OSDC


All of the software developed as part of the OSDC is open source and hosted on GitHub. You can directly help the scientific cloud computing community by contributing to the open source OSDC software stack.

Apply for Compute and Storage

Fill out a short proposal for an OSDC resource allocation. Allocations start at 16 dedicated cores and 1TB of storage, but scale depending on the project needs and level of organizational partnership.


Partner with us and add your own racks to the OSDC (we will manage them for you). Organizations can also join the Open Cloud Consortium (OCC) which is made up of working groups, including the OSDC.

Contact Us

Questions? Comments? Suggestions? Contact us at info@opencloudconsortium.org.