Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar;66(3):187-194.
doi: 10.3349/ymj.2023.0628.

Digital Phenotyping of Rare Endocrine Diseases Across International Data Networks and the Effect of Granularity of Original Vocabulary

Affiliations

Digital Phenotyping of Rare Endocrine Diseases Across International Data Networks and the Effect of Granularity of Original Vocabulary

Seunghyun Lee et al. Yonsei Med J. 2025 Mar.

Abstract

Purpose: Rare diseases occur in <50 per 100000 people and require lifelong management. However, essential epidemiological data on such diseases are lacking, and a consecutive monitoring system across time and regions remains to be established. Standardized digital phenotypes are required to leverage an international data network for research on rare endocrine diseases. We developed digital phenotypes for rare endocrine diseases using the observational medical outcome partnership common data model.

Materials and methods: Digital phenotypes of three rare endocrine diseases (medullary thyroid cancer, hypoparathyroidism, pheochromocytoma/paraganglioma) were validated across three databases that use different vocabularies: Severance Hospital's electronic health record from South Korea; IQVIA's United Kingdom (UK) database for general practitioners; and IQVIA's United States (US) hospital database for general hospitals. We estimated the performance of different digital phenotyping methods based on International Classification of Diseases (ICD)-10 in the UK and the US or systematized nomenclature of medicine clinical terms (SNOMED CT) in Korea.

Results: The positive predictive value of digital phenotyping was higher using SNOMED CT-based phenotyping than ICD-10-based phenotyping for all three diseases in Korea (e.g., pheochromocytoma/paraganglioma: ICD-10, 58%-62%; SNOMED CT, 89%). Estimated incidence rates by digital phenotyping were as follows: medullary thyroid cancer, 0.34-2.07 (Korea), 0.13-0.30 (US); hypoparathyroidism, 0.40-1.20 (Korea), 0.59-1.01 (US), 0.00-1.78 (UK); and pheochromocytoma/paraganglioma, 0.95-1.67 (Korea), 0.35-0.77 (US), 0.00-0.49 (UK).

Conclusion: Our findings demonstrate the feasibility of developing digital phenotyping of rare endocrine diseases and highlight the importance of implementing SNOMED CT in routine clinical practice to provide granularity for research.

Keywords: Common data model; digital phenotyping; hypoparathyroidism; medullary thyroid cancer; pheochromocytoma; rare diseases.

PubMed Disclaimer

Conflict of interest statement

Seng Chan You is the Chief Technology Officer of PHI Digital Healthcare. The other authors have no potential conflicts of interest to disclose.

Figures

Fig. 1
Fig. 1. Studyflow. In ICD-10-originated OMOP concepts-level 1, we used the operational definitions reported in previous Korean studies based on ICD-10. In ICD-10-originated OMOP concepts-level 2, we aimed to improve sensitivity or positive predictive value through expert consensus. The changes made from ICD-10-originated OMOP concepts-level 1 to ICD-10-originated OMOP concepts-level 2 were as follows: 1) for medullary thyroid cancer, we removed the exclusion criterion of “thyroglobulin test”; 2) for nonsurgical hypoparathyroidism, we added “patients taking levothyroxine” to the inclusion criteria; 3) for pheochromocytoma/paraganglioma, we revised the inclusion criterion from “measuring catecholamine at least twice during the entire period” to “measuring catecholamine at least once before and once after surgery”. In SNOMED CT-originated OMOP concepts, the vocabulary was changed from ICD-10 to SNOMED CT. The performance of these three digital phenotyping was tested using the RED–CDM database in South Korea, and subsequently applied to the IQVIA database (United Kingdom and United States). ICD-10, International Classification of Diseases-10; OMOP, observational medical outcome partnership; SNOMED CT, systematized nomenclature of medicine clinical terms; RED–CDM, rare endocrine disease–common data model.

Similar articles

Cited by

References

    1. Richter T, Nestler-Parr S, Babela R, Khan ZM, Tesoro T, Molsen E, et al. Rare disease terminology and definitions-a systematic global review: report of the ISPOR Rare Disease Special Interest Group. Value Health. 2015;18:906–914. - PubMed
    1. The Lancet Diabetes & Endocrinology. Spotlight on rare diseases. Lancet Diabetes Endocrinol. 2019;7:75. - PubMed
    1. Hartley T, Lemire G, Kernohan KD, Howley HE, Adams DR, Boycott KM. New diagnostic approaches for undiagnosed rare genetic diseases. Annu Rev Genomics Hum Genet. 2020;21:351–372. - PubMed
    1. Reincke M, Hokken-Koelega A. Perspectives of the European Society of Endocrinology (ESE) and the European Society of Paediatric Endocrinology (ESPE) on rare endocrine disease. Endocrine. 2021;71:539–541. - PMC - PubMed
    1. Marcucci G, Cianferotti L, Beck-Peccoz P, Capezzone M, Cetani F, Colao A, et al. Rare diseases in clinical endocrinology: a taxonomic classification system. J Endocrinol Invest. 2015;38:193–259. - PubMed

Supplementary concepts