Skip to Main Content

Biology Research Guide for Graduate Students: Data

Data

Google Dataset Search 

Google Dataset Search lets you find datasets hosted in data repositories only. Searching here first is a good choice when looking for data that might be hosted in a repository. This covers most of the repositories listed on this page. 

re3data

Search the re3data global registry of research data repositories to find appropriate academic discipline repositories. 

Zenodo

Zenodo is a general-purpose data repository built on open-source software that accepts all forms of research output from data files to presentation files. It was developed by the European Organization for Nuclear Research (CERN), but is open to researchers from outside the EU. Data is stored in the CERN Data Center, which provides long-term preservation. 

Dryad

Dryad is a curated, general-purpose data repository built on open-source software that is intended for sharing, publishing, and preserving publicly available research data from peer-reviewed publications in the basic sciences and medicine.

Note: Dryad is a non-profit venture supported in part by data publication charges. 

Open Science Framework

Open Science Framework (OSF), provided by the Center for Open Science (COS), is a free and open-source project management tool that supports researchers in open science best practices throughout their entire project lifecycle. OSF promotes open, centralized workflows by enabling the capture of different aspects and products of research. As a flexible repository, it can store and archive research data, protocols, and materials. 

Figshare

Figshare is a general-purpose file repository that accepts all forms of research output from data files to presentation files (e.g., PowerPoint presentations). The following discusses individual submissions to figshare, but additional features are available through institutional and publisher figshare instances. 

Harvard Dataverse

Harvard Dataverse is a general-purpose data repository built on open-source software that is intended for sharing and facilitating citation of research data. It is under continuous development by Harvard Library, Harvard University Information Technology (HUIT), and the Harvard Institute for Quantitative Social Science (IQSS). Several other institutions have made use of this open-source software project to develop independent Dataverse installations around the world. 

Mendeley Data

Mendeley Data is a free and secure cloud-based communal repository where you can store your data, ensuring it is easy to share, access and cite, wherever you are. Search more than 20+ million datasets indexed from 1000s of data repositories and collect and share datasets with the research community following the FAIR data principles.