YummyData: providing high-quality open life science data
- PMID: 29688370
- PMCID: PMC5846286
- DOI: 10.1093/database/bay022
YummyData: providing high-quality open life science data
Abstract
Many life science datasets are now available via Linked Data technologies, meaning that they are represented in a common format (the Resource Description Framework), and are accessible via standard APIs (SPARQL endpoints). While this is an important step toward developing an interoperable bioinformatics data landscape, it also creates a new set of obstacles, as it is often difficult for researchers to find the datasets they need. Different providers frequently offer the same datasets, with different levels of support: as well as having more or less up-to-date data, some providers add metadata to describe the content, structures, and ontologies of the stored datasets while others do not. We currently lack a place where researchers can go to easily assess datasets from different providers in terms of metrics such as service stability or metadata richness. We also lack a space for collecting feedback and improving data providers’ awareness of user needs. To address this issue, we have developed YummyData, which consists of two components. One periodically polls a curated list of SPARQL endpoints, monitoring the states of their Linked Data implementations and content. The other presents the information measured for the endpoints and provides a forum for discussion and feedback. YummyData is designed to improve the findability and reusability of life science datasets provided as Linked Data and to foster its adoption. It is freely accessible at http://yummydata.org/. Database URL: http://yummydata.org/
Figures
Similar articles
-
BioSharing: curated and crowd-sourced metadata standards, databases and data policies in the life sciences.Database (Oxford). 2016 May 17;2016:baw075. doi: 10.1093/database/baw075. Print 2016. Database (Oxford). 2016. PMID: 27189610 Free PMC article.
-
FAIR-compliant clinical, radiomics and DICOM metadata of RIDER, interobserver, Lung1 and head-Neck1 TCIA collections.Med Phys. 2020 Nov;47(11):5931-5940. doi: 10.1002/mp.14322. Epub 2020 Jun 27. Med Phys. 2020. PMID: 32521049 Free PMC article.
-
A large collection of bioinformatics question-query pairs over federated knowledge graphs: methodology and applications.Gigascience. 2025 Jan 6;14:giaf045. doi: 10.1093/gigascience/giaf045. Gigascience. 2025. PMID: 40378136 Free PMC article.
-
Extension of research data repository system to support direct compute access to biomedical datasets: enhancing Dataverse to support large datasets.Ann N Y Acad Sci. 2017 Jan;1387(1):95-104. doi: 10.1111/nyas.13272. Epub 2016 Nov 10. Ann N Y Acad Sci. 2017. PMID: 27862010 Free PMC article. Review.
-
Ontology application and use at the ENCODE DCC.Database (Oxford). 2015 Mar 16;2015:bav010. doi: 10.1093/database/bav010. Print 2015. Database (Oxford). 2015. PMID: 25776021 Free PMC article. Review.
Cited by
-
The SIB Swiss Institute of Bioinformatics Semantic Web of data.Nucleic Acids Res. 2024 Jan 5;52(D1):D44-D51. doi: 10.1093/nar/gkad902. Nucleic Acids Res. 2024. PMID: 37878411 Free PMC article.
-
Semantic Data Visualisation for Biomedical Database Catalogues.Healthcare (Basel). 2022 Nov 15;10(11):2287. doi: 10.3390/healthcare10112287. Healthcare (Basel). 2022. PMID: 36421611 Free PMC article.
-
A framework for integrating biomedical knowledge in Wikidata with open biological and biomedical ontologies and MeSH keywords.Heliyon. 2024 Sep 27;10(19):e38448. doi: 10.1016/j.heliyon.2024.e38448. eCollection 2024 Oct 15. Heliyon. 2024. PMID: 39403518 Free PMC article.
-
OMA orthology in 2021: website overhaul, conserved isoforms, ancestral gene order and more.Nucleic Acids Res. 2021 Jan 8;49(D1):D373-D379. doi: 10.1093/nar/gkaa1007. Nucleic Acids Res. 2021. PMID: 33174605 Free PMC article.
-
Providing Adverse Outcome Pathways from the AOP-Wiki in a Semantic Web Format to Increase Usability and Accessibility of the Content.Appl In Vitro Toxicol. 2022 Mar 1;8(1):2-13. doi: 10.1089/aivt.2021.0010. Epub 2022 Mar 17. Appl In Vitro Toxicol. 2022. PMID: 35388368 Free PMC article.
References
-
- Bizer C., Heath T., Berners-Lee T. (2009). Linked data-the story so far. International Journal on Semantic Web and Information Systems (IJSWIS), 5, 1–22.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources