Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Oct 5;11(10):5005-10.
doi: 10.1021/pr300567r. Epub 2012 Sep 11.

Proteomic classification of acute leukemias by alignment-based quantitation of LC-MS/MS data sets

Affiliations

Proteomic classification of acute leukemias by alignment-based quantitation of LC-MS/MS data sets

Eric J Foss et al. J Proteome Res. .

Abstract

Despite immense interest in the proteome as a source of biomarkers in cancer, mass spectrometry has yet to yield a clinically useful protein biomarker for tumor classification. To explore the potential of a particular class of mass spectrometry-based quantitation approaches, label-free alignment of liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS) data sets, for the identification of biomarkers for acute leukemias, we asked whether a label-free alignment algorithm could distinguish known classes of leukemias on the basis of their proteomes. This approach to quantitation involves (1) computational alignment of MS1 peptide peaks across large numbers of samples; (2) measurement of the relative abundance of peptides across samples by integrating the area under the curve of the MS1 peaks; and (3) assignment of peptide IDs to those quantified peptide peaks on the basis of the corresponding MS2 spectra. We extracted proteins from blasts derived from four patients with acute myeloid leukemia (AML, acute leukemia of myeloid lineage) and five patients with acute lymphoid leukemia (ALL, acute leukemia of lymphoid lineage). Mobilized CD34+ cells purified from peripheral blood of six healthy donors and mononuclear cells (MNC) from the peripheral blood of two healthy donors were used as healthy controls. Proteins were analyzed by LC-MS/MS and quantified with a label-free alignment-based algorithm developed in our laboratory. Unsupervised hierarchical clustering of blinded samples separated the samples according to their known biological characteristics, with each sample group forming a discrete cluster. The four proteins best able to distinguish CD34+, AML, and ALL were all either known biomarkers or proteins whose biological functions are consistent with their ability to distinguish these classes. We conclude that alignment-based label-free quantitation of LC-MS/MS data sets can, at least in some cases, robustly distinguish known classes of leukemias, thus opening the possibility that large scale studies using such algorithms can lead to the identification of clinically useful biomarkers.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Dendrogram and heat map showing similarities between 9 patient (ALL and AML) and 8 healthy (CD34+ and MNC) samples. Levels of 639 proteins were used to cluster samples according to Euclidean distance. Visual inspection of both the dendrogram and heat map demonstrate clear clustering of the four sample types, with AML and CD34+ showing the greatest degrees of similarity, as is expected. Colors indicate Euclidean distances between pairs of samples.
Figure 2
Figure 2
A. Heat map depicting relative protein levels and dendrogram constructed on the basis of Euclidean distances between these protein levels showing the 91 proteins we identified as capable of distinguishing the 5 ALL samples from the 6 CD34+ samples. Each column represents a patient and each row represents a protein. Higher and lower protein levels are indicated in red and green, respectively. Gray boxes indicate missing values. The dendrograms on the top and left show unsupervised hierarchical clustering of patient samples and proteins, respectively. The "all or none" proteins are indicated by asterixes on the right. B. Heat map and dendrogram as in figure 2A showing the 71 proteins we identified as capable of separating 5 ALL samples from 4 AML samples. C. Heat map and dendrogram as in figure 2A showing the 17 proteins we identified as capable of separating 6 CD34+ samples from 4 AML samples.
Figure 2
Figure 2
A. Heat map depicting relative protein levels and dendrogram constructed on the basis of Euclidean distances between these protein levels showing the 91 proteins we identified as capable of distinguishing the 5 ALL samples from the 6 CD34+ samples. Each column represents a patient and each row represents a protein. Higher and lower protein levels are indicated in red and green, respectively. Gray boxes indicate missing values. The dendrograms on the top and left show unsupervised hierarchical clustering of patient samples and proteins, respectively. The "all or none" proteins are indicated by asterixes on the right. B. Heat map and dendrogram as in figure 2A showing the 71 proteins we identified as capable of separating 5 ALL samples from 4 AML samples. C. Heat map and dendrogram as in figure 2A showing the 17 proteins we identified as capable of separating 6 CD34+ samples from 4 AML samples.
Figure 2
Figure 2
A. Heat map depicting relative protein levels and dendrogram constructed on the basis of Euclidean distances between these protein levels showing the 91 proteins we identified as capable of distinguishing the 5 ALL samples from the 6 CD34+ samples. Each column represents a patient and each row represents a protein. Higher and lower protein levels are indicated in red and green, respectively. Gray boxes indicate missing values. The dendrograms on the top and left show unsupervised hierarchical clustering of patient samples and proteins, respectively. The "all or none" proteins are indicated by asterixes on the right. B. Heat map and dendrogram as in figure 2A showing the 71 proteins we identified as capable of separating 5 ALL samples from 4 AML samples. C. Heat map and dendrogram as in figure 2A showing the 17 proteins we identified as capable of separating 6 CD34+ samples from 4 AML samples.

References

    1. Roukos DH. Trastuzumab and beyond: sequencing cancer genomes and predicting molecular networks. Pharmacogenomics J. 2011 Apr;11(2):81–92. - PubMed
    1. Bacher U, Schnittger S, Haferlach C, Haferlach T. Molecular diagnostics in acute leukemias. Clin Chem Lab Med. 2009;47(11):1333–1341. - PubMed
    1. Li L, Fridley BL, Kalari K, Jenkins G, Batzler A, Weinshilboum RM, et al. Gemcitabine and arabinosylcytosin pharmacogenomics: genome-wide association and drug response biomarkers. PLoS One. 2009;4(11):e7765. - PMC - PubMed
    1. Prensner JR, Iyer MK, Balbin OA, Dhanasekaran SM, Cao Q, Brenner JC, et al. Transcriptome sequencing across a prostate cancer cohort identifies PCAT-1, an unannotated lincRNA implicated in disease progression. Nat Biotechnol. 2011 Jul 31; - PMC - PubMed
    1. Kobayashi Y, Absher DM, Gulzar ZG, Young SR, McKenney JK, Peehl DM, et al. DNA methylation profiling reveals novel biomarkers and important roles for DNA methyltransferases in prostate cancer. Genome Res. 2011 Jul;21(7):1017–1027. - PMC - PubMed

Publication types

MeSH terms