Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jun 12;28(6):1265-1269.
doi: 10.1093/jamia/ocaa343.

A high-throughput phenotyping algorithm is portable from adult to pediatric populations

Affiliations

A high-throughput phenotyping algorithm is portable from adult to pediatric populations

Alon Geva et al. J Am Med Inform Assoc. .

Abstract

Objective: Multimodal automated phenotyping (MAP) is a scalable, high-throughput phenotyping method, developed using electronic health record (EHR) data from an adult population. We tested transportability of MAP to a pediatric population.

Materials and methods: Without additional feature engineering or supervised training, we applied MAP to a pediatric population enrolled in a biobank and evaluated performance against physician-reviewed medical records. We also compared performance of MAP at the pediatric institution and the original adult institution where MAP was developed, including for 6 phenotypes validated at both institutions against physician-reviewed medical records.

Results: MAP performed equally well in the pediatric setting (average AUC 0.98) as it did at the general adult hospital system (average AUC 0.96). MAP's performance in the pediatric sample was similar across the 6 specific phenotypes also validated against gold-standard labels in the adult biobank.

Conclusions: MAP is highly transportable across diverse populations and has potential for wide-scale use.

Keywords: biobank, high-throughput; data mining; electronic health records; phenotype.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Comparison of the area under the receiver operating characteristic curve of phenotyping methods for 6 diseases investigated at both Boston Children’s Hospital PrecisionLink Biobank and the Partners Biobank. Abbreviations: CD, Crohn’s disease; HF, heart failure; ICD, International Classification of Diseases; MAP, multimodal automated phenotyping; T1DM, type 1 diabetes mellitus; UC, ulcerative colitis.

References

    1. Gutiérrez-Sacristán A, De Niz C, Kothari C, Kong SW, Mandl KD, Avillach P. GenoPheno: cataloging large-scale phenotypic and next-generation sequencing data within human datasets. Brief Bioinform 2021; 22 (1): 55–65. - PMC - PubMed
    1. Bourgeois FAvillach PKong SW, et al. . . Development of the Precision Link Biobank at Boston Children’s Hospital: Challenges and Opportunities. JPM 2017; 7 (4): 21. - PMC - PubMed
    1. Henderson GE, Cadigan RJ, Edwards TP, et al.Characterizing biobank organizations in the US: results from a national survey. Genome Med 2013; 5 (1): 3. - PMC - PubMed
    1. Vaught J, Kelly A, Hewitt R.. A review of international biobanks and networks: success factors and key benchmarks. Biopreserv Biobank 2009; 7 (3): 143–50. - PMC - PubMed
    1. Zhang Y, Cai T, Yu S, et al.High-throughput phenotyping with electronic medical record data using a common semi-supervised approach (PheCAP). Nat Protoc 2019; 14 (12): 3426–44. - PMC - PubMed

Publication types