Evaluation of an automated genome interpretation model for rare disease routinely used in a clinical genetic laboratory
- PMID: 36939041
- DOI: 10.1016/j.gim.2023.100830
Evaluation of an automated genome interpretation model for rare disease routinely used in a clinical genetic laboratory
Abstract
Purpose: The analysis of exome and genome sequencing data for the diagnosis of rare diseases is challenging and time-consuming. In this study, we evaluated an artificial intelligence model, based on machine learning for automating variant prioritization for diagnosing rare genetic diseases in the Baylor Genetics clinical laboratory.
Methods: The automated analysis model was developed using a supervised learning approach based on thousands of manually curated variants. The model was evaluated on 2 cohorts. The model accuracy was determined using a retrospective cohort comprising 180 randomly selected exome cases (57 singletons, 123 trios); all of which were previously diagnosed and solved through manual interpretation. Diagnostic yield with the modified workflow was estimated using a prospective "production" cohort of 334 consecutive clinical cases.
Results: The model accurately pinpointed all manually reported variants as candidates. The reported variants were ranked in top 10 candidate variants in 98.4% (121/123) of trio cases, in 93.0% (53/57) of single proband cases, and 96.7% (174/180) of all cases. The accuracy of the model was reduced in some cases because of incomplete variant calling (eg, copy number variants) or incomplete phenotypic description.
Conclusion: The automated model for case analysis assists clinical genetic laboratories in prioritizing candidate variants effectively. The use of such technology may facilitate the interpretation of genomic data for a large number of patients in the era of precision medicine.
Keywords: Clinical genomics; Machine learning methods.
Copyright © 2023 American College of Medical Genetics and Genomics. Published by Elsevier Inc. All rights reserved.
Conflict of interest statement
Conflict of Interest S.T., R.A., Y.R., N.M., and P.S.-Y. were employees and stockholders of Emedgene Technologies during the time of the study or have received stock option awards from the company. S.T., R.A., Y.R., and N.M. were employees of Illumina Inc during the time of the study or have received stock option awards from the company. The Department of Molecular and Human Genetics at Baylor College of Medicine receives revenue from clinical genetic testing conducted at Baylor Genetics Laboratories. All other authors declare no conflicts of interest.
MeSH terms
LinkOut - more resources
Full Text Sources
Medical
