Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Sep 15;106(37):15544-8.
doi: 10.1073/pnas.0904100106. Epub 2009 Aug 26.

Protein quantification across hundreds of experimental conditions

Affiliations

Protein quantification across hundreds of experimental conditions

Zia Khan et al. Proc Natl Acad Sci U S A. .

Abstract

Quantitative studies of protein abundance rarely span more than a small number of experimental conditions and replicates. In contrast, quantitative studies of transcript abundance often span hundreds of experimental conditions and replicates. This situation exists, in part, because extracting quantitative data from large proteomics datasets is significantly more difficult than reading quantitative data from a gene expression microarray. To address this problem, we introduce two algorithmic advances in the processing of quantitative proteomics data. First, we use space-partitioning data structures to handle the large size of these datasets. Second, we introduce techniques that combine graph-theoretic algorithms with space-partitioning data structures to collect relative protein abundance data across hundreds of experimental conditions and replicates. We validate these algorithmic techniques by analyzing several datasets and computing both internal and external measures of quantification accuracy. We demonstrate the scalability of these techniques by applying them to a large dataset that comprises a total of 472 experimental conditions and replicates.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest statement: A patent for related technology has been filed with the U.S. Patent and Trademark office by Princeton University. Z.K., M.S., and L.K. are coinventors.

Figures

Fig. 1.
Fig. 1.
A representative LC-MS/MS scan. A scan consists of millions of peaks, each with a specific retention time, m/z, and intensity. The intensity of a peak is indicated in grayscale (darker gray corresponds to higher intensity). The first Inset shows several extracted ion chromatograms (XICs) corresponding to the naturally occurring isotopes of a tryptic peptide. The area under an XIC provides a measure of the relative abundance of a peptide. In this example, one peak in the XIC has an MS/MS fragmentation spectrum from which the peptide's amino acid sequence and, hence, the identity of the protein from which it derives can be determined.
Fig. 2.
Fig. 2.
Comparison of known protein concentration with relative abundance measured by mass spectrometry. Measured protein abundance is plotted on a log–log scale against known femtomole concentration for each of six nonhuman proteins spiked in to human serum by Mueller et al. The lines show best fit by regression, and the corresponding log–log scale slopes and correlation values (R2) for both the log–log scale and the original scale are shown below each plot.

References

    1. Domon B, Aebersold R. Mass spectrometry and protein analysis. Science. 2006;312:212–217. - PubMed
    1. Aebersold R, Mann M. Mass spectrometry-based proteomics. Nature. 2003;422:198–207. - PubMed
    1. Cox J, Mann M. Is proteomics the new genomics? Cell. 2007;130:395–398. - PubMed
    1. Ong S, Mann M. Mass spectrometry-based proteomics turns quantitative. Nat Chem Biol. 2005;1:252–262. - PubMed
    1. Geer LY, et al. Open mass spectrometry search algorithm. J Proteome Res. 2004;3:958–964. - PubMed

Publication types

LinkOut - more resources