Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May;43(5):539-546.
doi: 10.1002/humu.24362. Epub 2022 Mar 9.

An expanded phenotype centric benchmark of variant prioritisation tools

Affiliations

An expanded phenotype centric benchmark of variant prioritisation tools

Denise Anderson et al. Hum Mutat. 2022 May.

Abstract

Identifying the causal variant for diagnosis of genetic diseases is challenging when using next-generation sequencing approaches and variant prioritization tools can assist in this task. These tools provide in silico predictions of variant pathogenicity, however they are agnostic to the disease under study. We previously performed a disease-specific benchmark of 24 such tools to assess how they perform in different disease contexts. We found that the tools themselves show large differences in performance, but more importantly that the best tools for variant prioritization are dependent on the disease phenotypes being considered. Here we expand the assessment to 37 tools and refine our assessment by separating performance for nonsynonymous single nucleotide variants (nsSNVs) and missense variants (i.e., excluding nonsense variants). We found differences in performance for missense variants compared to nsSNVs and recommend three tools that stand out in terms of their performance (BayesDel, CADD, and ClinPred).

Keywords: dbNSFP; disease; human phenotype ontology; phenotype; variant prioritization.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Heatmaps showing performance (auPRC) of variant prioritization tools for missense variants (a) and nsSNVs (b). Color coding of columns is based on the method used to predict pathogenicity, where black = conservation scores, yellow = machine learning scores and red = ensemble scores. Hierarchical cluster analysis with Euclidean distance and complete agglomeration was used to cluster both the tools and the HPO terms. HPO, Human Phenotype Ontology
Figure 2
Figure 2
Boxplots showing auPRC of the top performing variant prioritization tools across selected top level HPO phenotypic abnormality terms and all their descendant terms for missense variants and nsSNVs. Abnormality of metabolism/homeostasis includes 340 terms for missense variants and 443 for nsSNVs. Abnormality of the immune system includes 242 terms for missense variants and 288 for nsSNVs. Abnormality of the nervous system includes 806 terms for missense variants and 898 for nsSNVs. Neoplasm includes 227 terms for missense variants and 291 for nsSNVs. auPRC, area under the precision‐recall curve; nsSNV, nonsynonymous single nucleotide variant

References

    1. Adzhubei, I. A. , Schmidt, S. , Peshkin, L. , Ramensky, V. E. , Gerasimova, A. , Bork, P. , Kondrashov, A. S. , & Sunyaev, S. R. (2010). A method and server for predicting damaging missense mutations. Nature Methods, 7(4), 248–249. 10.1038/nmeth0410-248 - DOI - PMC - PubMed
    1. Alirezaie, N. , Kernohan, K. D. , Hartley, T. , Majewski, J. , & Hocking, T. D. (2018). ClinPred: Prediction tool to identify disease‐relevant nonsynonymous single‐nucleotide variants. American Journal of Human Genetics, 103(4), 474–483. 10.1016/j.ajhg.2018.08.005 - DOI - PMC - PubMed
    1. Anderson, D. , & Lassmann, T. (2018). A phenotype centric benchmark of variant prioritisation tools. NPJ Genomic Medicine, 3, 5. 10.1038/s41525-018-0044-9 - DOI - PMC - PubMed
    1. Carter, H. , Douville, C. , Stenson, P. D. , Cooper, D. N. , & Karchin, R. (2013). Identifying Mendelian disease genes with the variant effect scoring tool. BMC Genomics, 14(Suppl 3), S3. 10.1186/1471-2164-14-S3-S3 - DOI - PMC - PubMed
    1. Caspar, S. M. , Dubacher, N. , Kopps, A. M. , Meienberg, J. , Henggeler, C. , & Matyas, G. (2018). Clinical sequencing: From raw data to diagnosis with lifetime value. Clinical Genetics, 93(3), 508–519. 10.1111/cge.13190 - DOI - PubMed

Publication types