Common sequence variants affect molecular function more than rare variants?
- PMID: 28487536
- PMCID: PMC5431670
- DOI: 10.1038/s41598-017-01054-2
Common sequence variants affect molecular function more than rare variants?
Abstract
Any two unrelated individuals differ by about 10,000 single amino acid variants (SAVs). Do these impact molecular function? Experimental answers cannot answer comprehensively, while state-of-the-art prediction methods can. We predicted the functional impacts of SAVs within human and for variants between human and other species. Several surprising results stood out. Firstly, four methods (CADD, PolyPhen-2, SIFT, and SNAP2) agreed within 10 percentage points on the percentage of rare SAVs predicted with effect. However, they differed substantially for the common SAVs: SNAP2 predicted, on average, more effect for common than for rare SAVs. Given the large ExAC data sets sampling 60,706 individuals, the differences were extremely significant (p-value < 2.2e-16). We provided evidence that SNAP2 might be closer to reality for common SAVs than the other methods, due to its different focus in development. Secondly, we predicted significantly higher fractions of SAVs with effect between healthy individuals than between species; the difference increased for more distantly related species. The same trends were maintained for subsets of only housekeeping proteins and when moving from exomes of 1,000 to 60,000 individuals. SAVs frozen at speciation might maintain protein function, while many variants within a species might bring about crucial changes, for better or worse.
Conflict of interest statement
The authors declare that they have no competing interests.
Figures





Similar articles
-
Protein-protein and protein-nucleic acid binding residues important for common and rare sequence variants in human.BMC Bioinformatics. 2020 Oct 13;21(1):452. doi: 10.1186/s12859-020-03759-0. BMC Bioinformatics. 2020. PMID: 33050876 Free PMC article.
-
Low Diversity of Human Variation Despite Mostly Mild Functional Impact of De Novo Variants.Front Mol Biosci. 2021 Mar 18;8:635382. doi: 10.3389/fmolb.2021.635382. eCollection 2021. Front Mol Biosci. 2021. PMID: 33816556 Free PMC article.
-
Variant effect predictions capture some aspects of deep mutational scanning experiments.BMC Bioinformatics. 2020 Mar 17;21(1):107. doi: 10.1186/s12859-020-3439-4. BMC Bioinformatics. 2020. PMID: 32183714 Free PMC article.
-
The DBSAV Database: Predicting Deleteriousness of Single Amino Acid Variations in the Human Proteome.J Mol Biol. 2021 May 28;433(11):166915. doi: 10.1016/j.jmb.2021.166915. Epub 2021 Mar 4. J Mol Biol. 2021. PMID: 33676930 Free PMC article.
-
Better prediction of functional effects for sequence variants.BMC Genomics. 2015;16 Suppl 8(Suppl 8):S1. doi: 10.1186/1471-2164-16-S8-S1. Epub 2015 Jun 18. BMC Genomics. 2015. PMID: 26110438 Free PMC article.
Cited by
-
Protein-protein and protein-nucleic acid binding residues important for common and rare sequence variants in human.BMC Bioinformatics. 2020 Oct 13;21(1):452. doi: 10.1186/s12859-020-03759-0. BMC Bioinformatics. 2020. PMID: 33050876 Free PMC article.
-
Low Diversity of Human Variation Despite Mostly Mild Functional Impact of De Novo Variants.Front Mol Biosci. 2021 Mar 18;8:635382. doi: 10.3389/fmolb.2021.635382. eCollection 2021. Front Mol Biosci. 2021. PMID: 33816556 Free PMC article.
-
Assessment of methods for predicting the effects of PTEN and TPMT protein variants.Hum Mutat. 2019 Sep;40(9):1495-1506. doi: 10.1002/humu.23838. Epub 2019 Jul 3. Hum Mutat. 2019. PMID: 31184403 Free PMC article.
-
Protein embeddings and deep learning predict binding residues for various ligand classes.Sci Rep. 2021 Dec 13;11(1):23916. doi: 10.1038/s41598-021-03431-4. Sci Rep. 2021. PMID: 34903827 Free PMC article.
-
Expert-guided protein language models enable accurate and blazingly fast fitness prediction.Bioinformatics. 2024 Nov 1;40(11):btae621. doi: 10.1093/bioinformatics/btae621. Bioinformatics. 2024. PMID: 39576695 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources