Pathogenicity and functional impact of non-frameshifting insertion/deletion variation in the human genome
- PMID: 31199787
- PMCID: PMC6594643
- DOI: 10.1371/journal.pcbi.1007112
Pathogenicity and functional impact of non-frameshifting insertion/deletion variation in the human genome
Abstract
Differentiation between phenotypically neutral and disease-causing genetic variation remains an open and relevant problem. Among different types of variation, non-frameshifting insertions and deletions (indels) represent an understudied group with widespread phenotypic consequences. To address this challenge, we present a machine learning method, MutPred-Indel, that predicts pathogenicity and identifies types of functional residues impacted by non-frameshifting insertion/deletion variation. The model shows good predictive performance as well as the ability to identify impacted structural and functional residues including secondary structure, intrinsic disorder, metal and macromolecular binding, post-translational modifications, allosteric sites, and catalytic residues. We identify structural and functional mechanisms impacted preferentially by germline variation from the Human Gene Mutation Database, recurrent somatic variation from COSMIC in the context of different cancers, as well as de novo variants from families with autism spectrum disorder. Further, the distributions of pathogenicity prediction scores generated by MutPred-Indel are shown to differentiate highly recurrent from non-recurrent somatic variation. Collectively, we present a framework to facilitate the interrogation of both pathogenicity and the functional effects of non-frameshifting insertion/deletion variants. The MutPred-Indel webserver is available at http://mutpred.mutdb.org/.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures






Similar articles
-
DDIG-in: discriminating between disease-associated and neutral non-frameshifting micro-indels.Genome Biol. 2013 Mar 13;14(3):R23. doi: 10.1186/gb-2013-14-3-r23. Genome Biol. 2013. PMID: 23497682 Free PMC article.
-
Quantitative prediction of the effect of genetic variation using hidden Markov models.BMC Bioinformatics. 2014 Jan 9;15:5. doi: 10.1186/1471-2105-15-5. BMC Bioinformatics. 2014. PMID: 24405700 Free PMC article.
-
INDELpred: Improving the prediction and interpretation of indel pathogenicity within the clinical genome.HGG Adv. 2024 Oct 10;5(4):100325. doi: 10.1016/j.xhgg.2024.100325. Epub 2024 Jul 10. HGG Adv. 2024. PMID: 38993112 Free PMC article.
-
Predicting the effects of frameshifting indels.Genome Biol. 2012 Feb 9;13(2):R9. doi: 10.1186/gb-2012-13-2-r9. Genome Biol. 2012. PMID: 22322200 Free PMC article.
-
Identifying the Impact of Inframe Insertions and Deletions on Protein Function in Cancer.J Comput Biol. 2020 May;27(5):786-795. doi: 10.1089/cmb.2018.0192. Epub 2019 Aug 28. J Comput Biol. 2020. PMID: 31460787
Cited by
-
Expanding the PURA syndrome phenotype: A child with the recurrent PURA p.(Phe233del) pathogenic variant showing similarities with cutis laxa.Mol Genet Genomic Med. 2021 Jan;9(1):e1562. doi: 10.1002/mgg3.1562. Epub 2020 Dec 4. Mol Genet Genomic Med. 2021. PMID: 33275834 Free PMC article.
-
Identification of a Novel Idiopathic Epilepsy Risk Locus and a Variant in the CCDC85A Gene in the Dutch Partridge Dog.Animals (Basel). 2023 Feb 23;13(5):810. doi: 10.3390/ani13050810. Animals (Basel). 2023. PMID: 36899667 Free PMC article.
-
Evaluation of in silico pathogenicity prediction tools for the classification of small in-frame indels.BMC Med Genomics. 2023 Feb 28;16(1):36. doi: 10.1186/s12920-023-01454-6. BMC Med Genomics. 2023. PMID: 36855133 Free PMC article.
-
Genome interpretation using in silico predictors of variant impact.Hum Genet. 2022 Oct;141(10):1549-1577. doi: 10.1007/s00439-022-02457-6. Epub 2022 Apr 30. Hum Genet. 2022. PMID: 35488922 Free PMC article. Review.
-
VariBench, new variation benchmark categories and data sets.Front Bioinform. 2023 Sep 19;3:1248732. doi: 10.3389/fbinf.2023.1248732. eCollection 2023. Front Bioinform. 2023. PMID: 37795169 Free PMC article. No abstract available.
References
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources