Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan 8;47(D1):D1172-D1178.
doi: 10.1093/nar/gky1061.

BioSamples database: an updated sample metadata hub

Affiliations

BioSamples database: an updated sample metadata hub

Mélanie Courtot et al. Nucleic Acids Res. .

Abstract

The BioSamples database at EMBL-EBI provides a central hub for sample metadata storage and linkage to other EMBL-EBI resources. BioSamples has recently undergone major changes, both in terms of data content and supporting infrastructure. The data content has more than doubled from around 2 million samples in 2014 to just over 5 million samples in 2018. Fast, reciprocal data exchange was fully established between sister Biosample databases and other INSDC partners, enabling a worldwide common representation and centralization of sample metadata. The BioSamples platform has been upgraded to accommodate anticipated increases in the number of submissions via GA4GH driver projects such as the Human Cell Atlas and the EGA, as well as from mirroring of NCBI dbGaP data. The BioSamples database is now the authoritative repository for all INSDC sample metadata, an ELIXIR Deposition Database for Biomolecular Data and the EMBL-EBI sample metadata hub. To support faster turnaround for sample submission, and to increase scalability and resilience, we have upgraded the BioSamples database backend storage, APIs and user interface. Finally, the website has been redesigned to allow search and retrieval of records based on specific filters, such as 'disease' or 'organism'. These changes are targeted at answering current use cases as well as providing functionalities for future emerging and anticipated developments. Availability: The BioSamples database is freely available at http://www.ebi.ac.uk/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https://www.ebi.ac.uk/about/terms-of-use.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A search query for ‘Homo sapiens’ samples with ‘disease state’ associated metadata and links to external database visualized in the new BioSamples user interface. https://www.ebi.ac.uk/biosamples/samples?filter=attr:Organism:Homo+sapiens&filter=attr:disease+state&filter=attr:external+reference.
Figure 2.
Figure 2.
An example of user-supplied JSON block representing AMR data. On the left hand side the JSON file and on the right side the rendering in the BioSamples UI. The block boxes contain the attribute name/value pairs, the orange boxes frame the AMR data Some attributes omitted from for clarity.
Figure 3.
Figure 3.
Users can submit and search the BioSamples database via APIs. The BioSamples services perform required operations including data persistence using a MongoDB, indexing via Apache Solr, and curation using external EMBL-EBI services OLS and Zooma. BioSamples are exchanged with INSDC partners and exported in multiple formats. The application also provides links to EMBl-EBI archives (blue circle are established links, grey circles are under development).

References

    1. Kolesnikov N., Hastings E., Keays M., Melnichuk O., Tang Y.A., Williams E., Dylag M., Kurbatova N., Brandizi M., Burdett T. et al. ArrayExpress update–simplifying data submissions. Nucleic Acids Res. 2015; 43:D1113–D1116. - PMC - PubMed
    1. Silvester N., Alako B., Amid C., Cerdeño-Tarrága A., Clarke L., Cleland I., Harrison P.W., Jayathilaka S., Kay S., Keane T. et al. The European Nucleotide Archive in 2017. Nucleic Acids Res. 2018; 46:D36–D40. - PMC - PubMed
    1. Lappalainen I., Almeida-King J., Kumanduri V., Senf A., Spalding J.D., Ur-Rehman S., Saunders G., Kandasamy J., Caccamo M., Leinonen R. et al. The European Genome-phenome Archive of human data consented for biomedical research. Nat. Genet. 2015; 47:692–695. - PMC - PubMed
    1. Faulconbridge A., Burdett T., Brandizi M., Gostev M., Pereira R., Vasant D., Sarkans U., Brazma A., Parkinson H.. Updates to BioSamples database at European Bioinformatics Institute. Nucleic Acids Res. 2014; 42:D50–D52. - PMC - PubMed
    1. Barrett T., Clark K., Gevorgyan R., Gorelenkov V., Gribov E., Karsch-Mizrachi I., Kimelman M., Pruitt K.D., Resenchuk S., Tatusova T. et al. BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata. Nucleic Acids Res. 2012; 40:D57–D63. - PMC - PubMed

Publication types

MeSH terms