The Common Data Elements for cancer research: remarks on functions and structure
- PMID: 17149500
- PMCID: PMC2980785
The Common Data Elements for cancer research: remarks on functions and structure
Abstract
Objectives: The National Cancer Institute (NCI) has developed the Common Data Elements (CDE) to serve as a controlled vocabulary of data descriptors for cancer research, to facilitate data interchange and inter-operability between cancer research centers. We evaluated CDE's structure to see whether it could represent the elements necessary to support its intended purpose, and whether it could prevent errors and inconsistencies from being accidentally introduced. We also performed automated checks for certain types of content errors that provided a rough measure of curation quality.
Methods: Evaluation was performed on CDE content downloaded via the NCI's CDE Browser, and transformed into relational database form. Evaluation was performed under three categories: 1) compatibility with the ISO/IEC 11179 metadata model, on which CDE structure is based, 2) features necessary for controlled vocabulary support, and 3) support for a stated NCI goal, set up of data collection forms for cancer research.
Results: Various limitations were identified both with respect to content (inconsistency, insufficient definition of elements, redundancy) as well as structure--particularly the need for term and relationship support, as well as the need for metadata supporting the explicit representation of electronic forms that utilize sets of common data elements.
Conclusions: While there are numerous positive aspects to the CDE effort, there is considerable opportunity for improvement. Our recommendations include review of existing content by diverse experts in the cancer community; integration with the NCI thesaurus to take advantage of the latter's links to nationally used controlled vocabularies, and various schema enhancements required for electronic form support.
Figures


Similar articles
-
Quality evaluation of value sets from cancer study common data elements using the UMLS semantic groups.J Am Med Inform Assoc. 2012 Jun;19(e1):e129-36. doi: 10.1136/amiajnl-2011-000739. Epub 2012 Apr 17. J Am Med Inform Assoc. 2012. PMID: 22511016 Free PMC article.
-
Achieving interoperability for metadata registries using comparative object modeling.Stud Health Technol Inform. 2010;160(Pt 2):1136-9. Stud Health Technol Inform. 2010. PMID: 20841861
-
Mapping clinical phenotype data elements to standardized metadata repositories and controlled terminologies: the eMERGE Network experience.J Am Med Inform Assoc. 2011 Jul-Aug;18(4):376-86. doi: 10.1136/amiajnl-2010-000061. Epub 2011 May 19. J Am Med Inform Assoc. 2011. PMID: 21597104 Free PMC article.
-
[caCORE: core architecture of bioinformation on cancer research in America].Beijing Da Xue Xue Bao Yi Xue Ban. 2006 Apr 18;38(2):218-21. Beijing Da Xue Xue Bao Yi Xue Ban. 2006. PMID: 16617371 Review. Chinese.
-
Common Data Elements for Unruptured Intracranial Aneurysm and Subarachnoid Hemorrhage Clinical Research: Recommendations from the Working Group on Long-Term Therapies.Neurocrit Care. 2019 Jun;30(Suppl 1):79-86. doi: 10.1007/s12028-019-00727-2. Neurocrit Care. 2019. PMID: 31077078
Cited by
-
ODMSummary: A Tool for Automatic Structured Comparison of Multiple Medical Forms Based on Semantic Annotation with the Unified Medical Language System.PLoS One. 2016 Oct 13;11(10):e0164569. doi: 10.1371/journal.pone.0164569. eCollection 2016. PLoS One. 2016. PMID: 27736972 Free PMC article.
-
Automated Tools for Clinical Research Data Quality Control using NCI Common Data Elements.AMIA Jt Summits Transl Sci Proc. 2014 Apr 7;2014:60-9. eCollection 2014. AMIA Jt Summits Transl Sci Proc. 2014. PMID: 25717402 Free PMC article.
-
An exploratory study using an openEHR 2-level modeling approach to represent common data elements.J Am Med Inform Assoc. 2016 Sep;23(5):956-67. doi: 10.1093/jamia/ocv137. Epub 2016 Jan 23. J Am Med Inform Assoc. 2016. PMID: 26911823 Free PMC article.
-
Pragmatic MDR: a metadata repository with bottom-up standardization of medical metadata through reuse.BMC Med Inform Decis Mak. 2021 May 17;21(1):160. doi: 10.1186/s12911-021-01524-8. BMC Med Inform Decis Mak. 2021. PMID: 34001121 Free PMC article.
-
QL4MDR: a GraphQL query language for ISO 11179-based metadata repositories.BMC Med Inform Decis Mak. 2019 Mar 18;19(1):45. doi: 10.1186/s12911-019-0794-z. BMC Med Inform Decis Mak. 2019. PMID: 30885183 Free PMC article.
References
-
- National Cancer Institute. Cancer Bioinformatics Grid. 2004. [Last accessed: 11/25/04]. http://cabig.nci.nih.gov.
-
- Marco D. Building and Managing the Metadata Repository. New York: Wiley; 2000.
-
- National Library of Medicine. Medical Subject Headings- Home Page. 2004. [Last accessed: 11/25/04]. www.nlm.nih.gov/mesh/meshhome.html.
-
- Regenstrief Institute. LOINC home page. 2002. [Last accessed: 7/8/02]. http://www.regenstrief.org/loinc/
-
- College of American Pathologists. SNOMED Clinical Terms (SNOMED CT) 2002. [Last accessed: 10/2/02]. www.snomed.org.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources