Functional characterization of 3D protein structures informed by human genetic diversity
- PMID: 30988206
- PMCID: PMC6500140
- DOI: 10.1073/pnas.1820813116
Functional characterization of 3D protein structures informed by human genetic diversity
Abstract
Sequence variation data of the human proteome can be used to analyze 3D protein structures to derive functional insights. We used genetic variant data from nearly 140,000 individuals to analyze 3D positional conservation in 4,715 proteins and 3,951 homology models using 860,292 missense and 465,886 synonymous variants. Sixty percent of protein structures harbor at least one intolerant 3D site as defined by significant depletion of observed over expected missense variation. Structural intolerance data correlated with deep mutational scanning functional readouts for PPARG, MAPK1/ERK2, UBE2I, SUMO1, PTEN, CALM1, CALM2, and TPK1 and with shallow mutagenesis data for 1,026 proteins. The 3D structural intolerance analysis revealed different features for ligand binding pockets and orthosteric and allosteric sites. Large-scale data on human genetic variation support a definition of functional 3D sites proteome-wide.
Keywords: deep mutational scanning; exome; genome constraint; protein structure.
Copyright © 2019 the Author(s). Published by PNAS.
Conflict of interest statement
Conflict of interest statement: M.H. is an employee of Human Longevity, Inc. J.C.V. owns stock in Human Longevity, Inc.
Figures



References
-
- Biesecker LG, Green RC. Diagnostic clinical genome and exome sequencing. N Engl J Med. 2014;371:1170. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous