Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb 4;11(1):786.
doi: 10.15252/msb.20145728.

Quantitative variability of 342 plasma proteins in a human twin population

Affiliations

Quantitative variability of 342 plasma proteins in a human twin population

Yansheng Liu et al. Mol Syst Biol. .

Abstract

The degree and the origins of quantitative variability of most human plasma proteins are largely unknown. Because the twin study design provides a natural opportunity to estimate the relative contribution of heritability and environment to different traits in human population, we applied here the highly accurate and reproducible SWATH mass spectrometry technique to quantify 1,904 peptides defining 342 unique plasma proteins in 232 plasma samples collected longitudinally from pairs of monozygotic and dizygotic twins at intervals of 2-7 years, and proportioned the observed total quantitative variability to its root causes, genes, and environmental and longitudinal factors. The data indicate that different proteins show vastly different patterns of abundance variability among humans and that genetic control and longitudinal variation affect protein levels and biological processes to different degrees. The data further strongly suggest that the plasma concentrations of clinical biomarkers need to be calibrated against genetic and temporal factors. Moreover, we identified 13 cis-SNPs significantly influencing the level of specific plasma proteins. These results therefore have immediate implications for the effective design of blood-based biomarker studies.

Keywords: SWATH‐MS; heritability; longitudinal variability; plasma biomarkers; twin study.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Experimental design
The plasma proteomic survey of a longitudinal twin cohort was performed with SWATH-MS, an emerging mass spectrometry technique providing high quantitative accuracy and reproducibility. The observed overall variance of protein abundance was partitioned into four biological contributing factors (heritable, common environmental, individual environmental and longitudinally contributing factors) and unexplained effects using a linear mixed model.
Figure 2
Figure 2. Proteomic identification and reproducible quantification among the twin cohort
  1. A The establishment of a comprehensive, specific spectral library of human plasma proteome that is ready for SWATH-MS analysis. The reference spectra were generated a priori by extensive shotgun proteomic sequencing of fractionated plasma peptides and complemented with spectra of additional known plasma proteins (Farrah et al, 2011).

  2. B The coefficient of variance (CV) analysis at technical, whole-process experimental and cohort levels.

  3. C Heatmap of hierarchical clustering analysis of the protein vs. sample matrix indicates that plasma proteins were reproducibly quantified by SWATH-MS across the sample cohort.

  4. D The distribution of the fold changes of protein SWATH-MS intensities from their average abundance levels across the cohort is shown as box plots. Antithrombin III (ANT3) and vitamin D-binding protein (VTDB) are shown as examples of the most stable proteins, whereas apolipoprotein(a) (APOA) and serum amyloid A-1 protein (SAA1) are shown as examples of the most variable proteins. The red bars show the positions of the protein analytes whose measurement has been cleared or approved by FDA in human blood.

Figure 3
Figure 3. Dissection of the plasma protein-level variability
  1. A Histograms of contribution percentage of each biological component (red: heritability; light blue: common environment; dark blue: individual environment; yellow: longitudinal effects; gray: unexplained fraction) determined by a linear mixed model based on the longitudinal twin design. Selected protein names are shown for those clinically assayed proteins with the most heritable levels, for example, APOA, plasma protease C1 inhibitor (IC1), complement C4-A (CO4A), fibrinogen alpha chain (FIBA) and pancreatic triacylglycerol lipase (LIPP), and for the most variable proteins among the cohort, for example, SAA1, APOA, pregnancy zone protein (PZP), trifunctional enzyme subunit alpha (ECHA) and C-reactive protein (CRP).

  2. B Examples of protein abundance correlations between MZ and DZ pairs. Fibrinogen beta chain (FIBB), Serum paraoxonase/arylesterase 1 (PON1).

Figure 4
Figure 4. Biological and biomedical insights derived from twin proteomic data
  1. A Enrichment analysis of pathways and biological processes regulated by the four biological components was performed. The results were compiled into clusters according to the functional annotation of proteins.

  2. B Low correlation between plasma protein levels extracted from PeptideAtlas (www.peptideatlas.org) and their heritability contributing percentages in biological variance indicating the lack of an abundance bias. In contrast to that, concentration variability of more abundant proteins is generally less affected by longitudinal factors. The light red dashed line indicates the protein concentration of 1 μg/ml, which separates the proteins into two abundance classes.

  3. C Comparison of high-density lipoproteins (HDLs) and other proteins, using the heritability contributing percentages in biological variance of the plasma protein levels.

  4. D Comparison of heritability contributing percentages in biological variance between those proteins annotated as glycoproteins and other proteins. P-values are determined by Wilcoxon rank-sum test.

  5. E Decreasing trend of heritability control in plasma protein levels along with 5-year longitudinal process.

  6. F Clinically assayed proteins generally show lower quantitative variability compared to other plasma proteins with few exceptions, for example, CRP and APOA.

  7. G Examples of previously reported protein biomarker candidates, the plasma abundance levels of which were highly regulated by longitudinal effects. These include hexokinase-1 (HXK1), triosephosphate isomerase (TPIS), 14-3-3 protein zeta/delta (1433Z), platelet basic protein (CXCL7), monocyte differentiation antigen CD14 (CD14), biotinidase (BTD), serotransferrin (TRFE) and thyroxine-binding globulin (THBG).

Data information: P-values are determined by Wilcoxon rank-sum test.
Figure 5
Figure 5. pQTL discovery in human plasma
  1. A Manhattan plot of the best P-value per gene, highlighting the 13 statistically significant pQTL associations. The asterisks indicate that the corresponding eQTLs were found in human tissues. The cutoff of the P-value is 6.166e-3.

  2. B Examples of pQTLs: plasma protein levels among the cohort of four proteins associated with innate immune response distributed by distinct genotypes of the SNPs (see Supplementary Table S3 for all abbreviations of protein names).

References

    1. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491:56–65. - PMC - PubMed
    1. Aebersold R, Anderson L, Caprioli R, Druker B, Hartwell L, Smith R. Perspective: a program to improve protein biomarker discovery for cancer. J Proteome Res. 2005;4:1104–1109. - PubMed
    1. Almasy L, Blangero J. Multipoint quantitative-trait linkage analysis in general pedigrees. Am J Hum Genet. 1998;62:1198–1211. - PMC - PubMed
    1. Anderson GL, McIntosh M, Wu L, Barnett M, Goodman G, Thorpe JD, Bergan L, Thornquist MD, Scholler N, Kim N, O'Briant K, Drescher C, Urban N. Assessing lead time of selected ovarian cancer biomarkers: a nested case-control study. J Natl Cancer Inst. 2010;102:26–38. - PMC - PubMed
    1. Anderson NL. The clinical plasma proteome: a survey of clinical assays for proteins in plasma and serum. Clin Chem. 2010;56:177–185. - PubMed

Publication types