Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec;11(12):e002170.
doi: 10.1161/CIRCGEN.118.002170.

Identification of Common and Rare Genetic Variation Associated With Plasma Protein Levels Using Whole-Exome Sequencing and Mass Spectrometry

Affiliations

Identification of Common and Rare Genetic Variation Associated With Plasma Protein Levels Using Whole-Exome Sequencing and Mass Spectrometry

Terry Solomon et al. Circ Genom Precis Med. 2018 Dec.

Abstract

Background: Identifying genetic variation associated with plasma protein levels, and the mechanisms by which they act, could provide insight into alterable processes involved in regulation of protein levels. Although protein levels can be affected by genetic variants, their estimation can also be biased by missense variants in coding exons causing technical artifacts. Integrating genome sequence genotype data with mass spectrometry-based protein level estimation could reduce bias, thereby improving detection of variation that affects RNA or protein metabolism.

Methods: Here, we integrate the blood plasma protein levels of 664 proteins from 165 participants of the Tromsø Study, measured via tandem mass tag mass spectrometry, with whole-exome sequencing data to identify common and rare genetic variation associated with peptide and protein levels (protein quantitative trait loci [pQTLs]). We additionally use literature and database searches to prioritize putative functional variants for each pQTL.

Results: We identify 109 independent associations (36 protein and 73 peptide) and use genotype data to exclude 49 (4 protein and 45 peptide) as technical artifacts. We describe 2 particular cases of rare variation: 1 associated with the complement pathway and 1 with platelet degranulation. We identify putative functional variants and show that pQTLs act through diverse molecular mechanisms that affect both RNA and protein metabolism.

Conclusions: We show that although the majority of pQTLs exert their effects by modulating RNA metabolism, many affect protein levels directly. Our work demonstrates the extent by which pQTL studies are affected by technical artifacts and highlights how prioritizing the functional variant in pQTL studies can lead to insights into the molecular steps by which a protein may be regulated.

Keywords: exons; genotype; plasma; proteomics; quantitative trait loci.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Study overview: 165 individuals from The Tromsø Study were followed from 1994–2013. Between 1994 and 1995, blood plasma and whole blood were collected; blood plasma and whole blood were processed and subsequently used for protein quantification by mass spectrometry and whole exome sequencing, respectively. These analyses identified 5,608 peptides and 664 proteins from plasma, and 3,148,863 variants from whole blood, across all individuals.
Figure 2.
Figure 2.
Description of protein and genotype data: (A) Cumulative distribution plot showing the number of peptides identified in at least N samples. 5,052 peptides were identified in at least 8 samples (blue), 3,394 peptides were identified in at least 82 samples (red), and 1,430 peptides were identified in all 165 samples (green). (B) Histogram showing the number of peptides identified for each of the 664 parent proteins. A mean of 8.45 peptides per parent protein were identified (dotted line). (C) Bar plot showing q-values from Reactome pathway analysis of the significantly enriched top level groups in the Reactome event hierarchy. The significance threshold of −log10(0.05) is shown by the red dotted line. (D) Histogram of the minor allele frequencies in this study for all 3,148,863 genetic variants identified across individuals. (E) Bar plot of the number of identified genetic variants within each SnpEff annotation. The number of variants with each annotation is also listed next to each bar.
Figure 3.
Figure 3.
Pathways identified from rare variation analyses: (A) An overview of the lectin complement pathway showing the relationship between FCN3 (Ficolin 3; teal) and the complement pathway. Nominal p-values are shown for the association between rare variation at the FCN3 locus and levels of the complement pathway proteins. C4, C3, C5, C8, and C6 were associated at a nominal P < 0.05 (purple), C2, C9, C7, or C5b were not associated (gray). (B) STRING database diagram of the five proteins associated with rare SERPINA1 variation (each labeled with their nominal association p-value). Connections between proteins are colored based on their evidence (see legend and STRING documentation).
Figure 4.
Figure 4.
Putative functional variant analyses: (A) Cartoon illustrating the genomic locations of variants with particular annotations and mechanisms, relative to the gene body of the pQTL. For example, indel annotated variants were only located within gene exons, but variants that have an underlying mechanism of “isoform” could be found in introns, exons, or the 3’ UTR. The three pQTLs where the PFV was a large genic deletion are not illustrated. (B) Stacked bar plot of the number of PFVs associated with each mechanism, subset by whether the mechanism affects the RNA molecule or the protein directly. (C) Stacked bar plot of the number of PFVs with each SnpEff annotation, subset by whether the PFVs’ mechanism affects the RNA molecule, the protein directly, or is unknown. (D) Stacked bar plot of the number of PFVs that were eQTLs in GETx, subset by whether the PFVs’ mechanism affects the RNA molecule, the protein directly, or is unknown.

Similar articles

Cited by

  • Differences and commonalities in the genetic architecture of protein quantitative trait loci in European and Arab populations.
    Thareja G, Belkadi A, Arnold M, Albagha OME, Graumann J, Schmidt F, Grallert H, Peters A, Gieger C, Consortium TQGPR, Suhre K. Thareja G, et al. Hum Mol Genet. 2023 Mar 6;32(6):907-916. doi: 10.1093/hmg/ddac243. Hum Mol Genet. 2023. PMID: 36168886 Free PMC article.
  • Genetic mechanisms of 184 neuro-related proteins in human plasma.
    Repetto L, Chen J, Yang Z, Zhai R, Timmers PRHJ, Li T, Twait EL, May-Wilson S, Muckian MD, Prins BP, Png G, Kooperberg C, Johansson Å, Hillary RF, Wheeler E, Pan L, He Y, Klasson S, Ahmad S, Peters JE, Gilly A, Karaleftheri M, Tsafantakis E, Haessler J, Gyllensten U, Harris SE, Wareham NJ, Göteson A, Lagging C, Ikram MA, van Duijn CM, Jern C, Landén M, Langenberg C, Deary IJ, Marioni RE, Enroth S, Reiner AP, Dedoussis G, Zeggini E, Butterworth AS, Mälarstig A, Wilson JF, Navarro P, Shen X. Repetto L, et al. medRxiv [Preprint]. 2023 Feb 14:2023.02.10.23285650. doi: 10.1101/2023.02.10.23285650. medRxiv. 2023. Update in: Nat Hum Behav. 2024 Nov;8(11):2222-2234. doi: 10.1038/s41562-024-01963-z. PMID: 36824751 Free PMC article. Updated. Preprint.
  • Coding and regulatory variants are associated with serum protein levels and disease.
    Emilsson V, Gudmundsdottir V, Gudjonsson A, Jonmundsson T, Jonsson BG, Karim MA, Ilkov M, Staley JR, Gudmundsson EF, Launer LJ, Lindeman JH, Morton NM, Aspelund T, Lamb JR, Jennings LL, Gudnason V. Emilsson V, et al. Nat Commun. 2022 Jan 25;13(1):481. doi: 10.1038/s41467-022-28081-6. Nat Commun. 2022. PMID: 35079000 Free PMC article.
  • The functional impact of rare variation across the regulatory cascade.
    Li T, Ferraro N, Strober BJ, Aguet F, Kasela S, Arvanitis M, Ni B, Wiel L, Hershberg E, Ardlie K, Arking DE, Beer RL, Brody J, Blackwell TW, Clish C, Gabriel S, Gerszten R, Guo X, Gupta N, Johnson WC, Lappalainen T, Lin HJ, Liu Y, Nickerson DA, Papanicolaou G, Pritchard JK, Qasba P, Shojaie A, Smith J, Sotoodehnia N, Taylor KD, Tracy RP, Van Den Berg D, Wheeler MT, Rich SS, Rotter JI, Battle A, Montgomery SB. Li T, et al. Cell Genom. 2023 Sep 6;3(10):100401. doi: 10.1016/j.xgen.2023.100401. eCollection 2023 Oct 11. Cell Genom. 2023. PMID: 37868038 Free PMC article.
  • Genetics meets proteomics: perspectives for large population-based studies.
    Suhre K, McCarthy MI, Schwenk JM. Suhre K, et al. Nat Rev Genet. 2021 Jan;22(1):19-37. doi: 10.1038/s41576-020-0268-2. Epub 2020 Aug 28. Nat Rev Genet. 2021. PMID: 32860016 Review.

References

    1. Jacobs JM, et al. Utilizing human blood plasma for proteomic biomarker discovery. J Proteome Res. 2005;4:1073–85. - PubMed
    1. Ong SE, et al. Identifying the proteins to which small-molecule probes and drugs bind in cells. Proc Natl Acad Sci U S A. 2009;106:4617–22. - PMC - PubMed
    1. Burgess S, et al. Mendelian randomization: where are we now and where are we going? Int J Epidemiol. 2015;44:379–88. - PubMed
    1. Cohen JC, et al. Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N Engl J Med. 2006;354:1264–72. - PubMed
    1. MacKeigan JP, et al. Proteomic profiling drug-induced apoptosis in non-small cell lung carcinoma: identification of RS/DJ-1 and RhoGDIalpha. Cancer Res. 2003;63:6928–34. - PubMed

Publication types