The center for expanded data annotation and retrieval
- PMID: 26112029
- PMCID: PMC5009916
- DOI: 10.1093/jamia/ocv048
The center for expanded data annotation and retrieval
Abstract
The Center for Expanded Data Annotation and Retrieval is studying the creation of comprehensive and expressive metadata for biomedical datasets to facilitate data discovery, data interpretation, and data reuse. We take advantage of emerging community-based standard templates for describing different kinds of biomedical datasets, and we investigate the use of computational techniques to help investigators to assemble templates and to fill in their values. We are creating a repository of metadata from which we plan to identify metadata patterns that will drive predictive data entry when filling in metadata templates. The metadata repository not only will capture annotations specified when experimental datasets are initially created, but also will incorporate links to the published literature, including secondary analyses and possible refinements or retractions of experimental interpretations. By working initially with the Human Immunology Project Consortium and the developers of the ImmPort data repository, we are developing and evaluating an end-to-end solution to the problems of metadata authoring and management that will generalize to other data-management environments.
Keywords: biological ontologies; data collection; data curation; datasets as topic; standards.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Figures



Similar articles
-
Development of an open metadata schema for prospective clinical research (openPCR) in China.Methods Inf Med. 2014;53(1):39-46. doi: 10.3414/ME13-01-0008. Epub 2013 Dec 9. Methods Inf Med. 2014. PMID: 24317371
-
Sustainable data and metadata management at the BD2K-LINCS Data Coordination and Integration Center.Sci Data. 2018 Jun 19;5:180117. doi: 10.1038/sdata.2018.117. Sci Data. 2018. PMID: 29917015 Free PMC article.
-
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific Experiments.Semant Web ISWC. 2017 Oct;10588:103-110. doi: 10.1007/978-3-319-68204-4_10. Epub 2017 Oct 4. Semant Web ISWC. 2017. PMID: 32219223 Free PMC article.
-
Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies?Brief Bioinform. 2008 Nov;9(6):466-78. doi: 10.1093/bib/bbn043. Epub 2008 Dec 6. Brief Bioinform. 2008. PMID: 19060303 Review.
-
Ontology application and use at the ENCODE DCC.Database (Oxford). 2015 Mar 16;2015:bav010. doi: 10.1093/database/bav010. Print 2015. Database (Oxford). 2015. PMID: 25776021 Free PMC article. Review.
Cited by
-
An ontology-driven tool for structured data acquisition using Web forms.J Biomed Semantics. 2017 Aug 1;8(1):26. doi: 10.1186/s13326-017-0133-1. J Biomed Semantics. 2017. PMID: 28764813 Free PMC article.
-
Developing a healthcare dataset information resource (DIR) based on Semantic Web.BMC Med Genomics. 2018 Nov 20;11(Suppl 5):102. doi: 10.1186/s12920-018-0411-5. BMC Med Genomics. 2018. PMID: 30453940 Free PMC article.
-
Unleashing the value of Common Data Elements through the CEDAR Workbench.AMIA Annu Symp Proc. 2020 Mar 4;2019:681-690. eCollection 2019. AMIA Annu Symp Proc. 2020. PMID: 32308863 Free PMC article.
-
BioHackathon 2015: Semantics of data for life sciences and reproducible research.F1000Res. 2020 Feb 24;9:136. doi: 10.12688/f1000research.18236.1. eCollection 2020. F1000Res. 2020. PMID: 32308977 Free PMC article.
-
FAIR-EuMon: a FAIR-enabling resource for biodiversity monitoring schemes.Biodivers Data J. 2024 Aug 1;12:e125132. doi: 10.3897/BDJ.12.e125132. eCollection 2024. Biodivers Data J. 2024. PMID: 39131439 Free PMC article.
References
-
- Borgman CL. The conundrum of sharing research data. J Am Soc Inform Sci Technol. 2012;63(6):1059–1078.
-
- Global Alliance for Genomics & Health. http://genomicsandhealth.org. Accessed March 23, 2015.
-
- FORCE11. The future of research communications and e-scholarship. https://www.force11.org. Accessed March 23, 2015.
-
- Research Data Alliance: research data sharing without barriers. https://rd-alliance.org. Accessed March 23, 2015.
-
- Yarmey L, Baker KS. Towards standardization: a participatory framework for scientific standard-making. Int J Digit Curation. 2013;8(1):157–172.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources