Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jul 20;10(1):470.
doi: 10.1038/s41597-023-02258-0.

The Translational Data Catalog - discoverable biomedical datasets

Affiliations

The Translational Data Catalog - discoverable biomedical datasets

Danielle Welter et al. Sci Data. .

Abstract

The discoverability of datasets resulting from the diverse range of translational and biomedical projects remains sporadic. It is especially difficult for datasets emerging from pre-competitive projects, often due to the legal constraints of data-sharing agreements, and the different priorities of the private and public sectors. The Translational Data Catalog is a single discovery point for the projects and datasets produced by a number of major research programmes funded by the European Commission. Funded by and rooted in a number of these European private-public partnership projects, the Data Catalog is built on FAIR-enabling community standards, and its mission is to ensure that datasets are findable and accessible by machines. Here we present its creation, content, value and adoption, as well as the next steps for sustainability within the ELIXIR ecosystem.

PubMed Disclaimer

Conflict of interest statement

SAS is Honorary Academic Editor of Scientific Data and PRS is a member of the Scientific Data Senior Editorial Board.

Figures

Fig. 1
Fig. 1
The Data Catalog Data Model. The model links the core data entities - Project, Study and Dataset - via directional relationships. Each core entity contains a set of relevant properties.
Fig. 2
Fig. 2
An example of a FAIRplus Evaluation result. The FAIRplus Evaluation panel lists the evaluation method used and the evaluation results, as well as links to the full assessment and the dataset (if publicly available).
Fig. 3
Fig. 3
Architecture diagram for the ELIXIR Luxembourg data ecosystem.
Fig. 4
Fig. 4
Illustration of the new Data Catalog user interface. This figure shows the dataset page for one of the Roche Immunomics datasets, part of the imSAVAR project (https://www.imi.europa.eu/projects-results/project-factsheets/imsavar), including the linking panels for the Project and Study pages, the general information panel with various metadata values including Experiment Types and Sample Types, and the Data Use Restrictions panel. The page also highlights the FAIRplus Evaluated badge.
Fig. 5
Fig. 5
An example of the page headers showing the Bioschemas integration for the OncoTrack dataset. The header snippet shows the instantiation of the Bioschemas Dataset profile for the ONCOTRACK (http://www.imi.europa.eu/projects-results/project-factsheets/onco-track) dataset, found at https://datacatalog.elixir-luxembourg.org/e/dataset/64f33e4f-0d6d-4062-86c5-9c3db4e3a99a.

References

    1. Wilkinson MD, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2016;3:160018. doi: 10.1038/sdata.2016.18. - DOI - PMC - PubMed
    1. European Commission. Directorate General for Research and Innovation. & PwC EU Services. Cost-benefit analysis for FAIR research data: cost of not having FAIR research data. (Publications Office, 2018).
    1. Sansone S-A, et al. DATS, the data tag suite to enable discoverability of datasets. Sci. Data. 2017;4:170059. doi: 10.1038/sdata.2017.59. - DOI - PMC - PubMed
    1. Ohno-Machado L, et al. Finding useful data across multiple biomedical data repositories using DataMed. Nat. Genet. 2017;49:816–819. doi: 10.1038/ng.3864. - DOI - PMC - PubMed
    1. Ohno-Machado L, 2015. bioCADDIE white paper - Data Discovery Index. Figshare. - DOI