An optimized variant prioritization process for rare disease diagnostics: recommendations for Exomiser and Genomiser
- PMID: 41121346
- PMCID: PMC12539062
- DOI: 10.1186/s13073-025-01546-1
An optimized variant prioritization process for rare disease diagnostics: recommendations for Exomiser and Genomiser
Abstract
Background: Exome sequencing (ES) and genome sequencing (GS) are increasingly used as standard genetic tests to identify diagnostic variants in rare disease cases. However, prioritizing these variants to reduce the time and burden of manual interpretation by clinical teams remains a significant challenge. The Exomiser/Genomiser software suite is the most widely adopted open-source software for prioritizing coding and noncoding variants. Despite its ubiquitous use, limited data-driven guidelines currently exist to optimize its performance for diagnostic variant prioritization. Based on detailed analyses of Undiagnosed Diseases Network (UDN) probands, this study presents optimized parameters and practical recommendations for deploying the Exomiser and Genomiser tools. We also highlight scenarios where diagnostic variants may be missed and propose alternative workflows to improve diagnostic success in such complex cases.
Methods: We analyzed 386 diagnosed probands from the UDN, including cases with coding and noncoding diagnostic variants. We systematically evaluated how tool performance was affected by key parameters, including gene:phenotype association data, variant pathogenicity predictors, phenotype term quality and quantity, and the inclusion and accuracy of family variant data.
Results: Parameter optimization significantly improved Exomiser's performance over default parameters. For GS data, the percentage of coding diagnostic variants ranked within the top 10 candidates increased from 49.7% to 85.5%, and for ES, from 67.3% to 88.2%. For noncoding variants prioritized with Genomiser, the top 10 rankings improved from 15.0% to 40.0%. We also explored refinement strategies for Exomiser outputs, including using p-value thresholds and flagging genes that are frequently ranked in the top 30 candidates but rarely associated with diagnoses.
Conclusion: This study provides an evidence-based framework for variant prioritization in ES and GS data using Exomiser and Genomiser. These recommendations have been implemented in the Mosaic platform to support the ongoing analysis of undiagnosed UDN participants and provide efficient, scalable reanalysis to improve diagnostic yield. Our work also highlights the importance of tracking solved cases and diagnostic variants that can be used to benchmark bioinformatics tools. Exomiser and Genomiser are available at https://github.com/exomiser/Exomiser/ .
Keywords: Diagnosis; Exome sequencing; Exomiser; Genome sequencing; Genomiser; HPO; Parameter optimization; Phenotype; Rare disease; Variant prioritization.
© 2025. The Author(s).
Conflict of interest statement
Declarations. Ethics approval and consent to participate: All work included in this study was performed in accordance with all ethical guidelines outlined in the NIH IRB no. 15HG0130 and the UDN Manual of Operations. All de-identified patient data included in this study was provided with informed consent by all participants to be used freely for research purposes across the network. The study proposal and this manuscript were approved by the UDN Publications and Research Committee. All research has been conducted in accordance with the Declaration of Helsinki. Consent for publication: Not applicable. Competing interests: A.W. and G.T.M. are co-founders and CEO and CSO, respectively, of Frameshift Labs, the developer of the Mosaic platform. The remaining authors declare no competing interests.
Figures
