Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2013 Aug 27;1(2):109-127.
doi: 10.3390/proteomes1020109.

Proteomic Workflows for Biomarker Identification Using Mass Spectrometry - Technical and Statistical Considerations during Initial Discovery

Affiliations
Review

Proteomic Workflows for Biomarker Identification Using Mass Spectrometry - Technical and Statistical Considerations during Initial Discovery

Dennis J Orton et al. Proteomes. .

Abstract

Identification of biomarkers capable of differentiating between pathophysiological states of an individual is a laudable goal in the field of proteomics. Protein biomarker discovery generally employs high throughput sample characterization by mass spectrometry (MS), being capable of identifying and quantifying thousands of proteins per sample. While MS-based technologies have rapidly matured, the identification of truly informative biomarkers remains elusive, with only a handful of clinically applicable tests stemming from proteomic workflows. This underlying lack of progress is attributed in large part to erroneous experimental design, biased sample handling, as well as improper statistical analysis of the resulting data. This review will discuss in detail the importance of experimental design and provide some insight into the overall workflow required for biomarker identification experiments. Proper balance between the degree of biological vs. technical replication is required for confident biomarker identification.

Keywords: biomarker discovery; experimental design; high dimensional data; randomization; replication.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
(Top) The number of PubMed search results as a function of year; (Bottom) The growth in number of publications corresponds directly to the application of numerous technologies and methods [9,10,11,12,13,14,15,16] used to improve throughput and sensitivity.
Figure 2
Figure 2
A schematic representation of potential sources of biomarkers. Less complex model systems provide a simpler starting point for biomarker investigation; however, the clinical utility of the analysis improves by transitioning to more complex model systems.
Figure 3
Figure 3
The influence of randomization on quantitative analysis of potential biomarkers. In the figure, successive MS runs were assumed to contribute a 5% decrease in signal intensity. A randomized design allowed proper characterization of the true biomarker while avoiding improper characterization of the false biomarker. In a biased design, the samples were analyzed in an improper grouping, which led to an apparent difference in the observed concentration of the false biomarker.
Figure 4
Figure 4
Increasing the level of fractionation greatly improves the number of proteins identified, though longer analysis time is required. Conversely, biomarker experiments require analysis of larger sample sizes to improve the biological significance of the identified proteins. In the discovery phase, biomarker experimentation must find a balance between these extremes. Pooling samples is one method to reduce the analysis time, however will also limit biological relevance.

References

    1. Rosenberg S., Elashoff M.R., Beineke P., Daniels S.E., Wingrove J.A., Tingley W.G., Sager P.T., Sehnert A.J., Yau M., Kraus W.E., et al. Multicenter validation of the diagnostic accuracy of a blood-based gene expression test for assessing obstructive coronary artery disease in nondiabetic patients. Ann. Int. Med. 2010;153:425–434. doi: 10.7326/0003-4819-153-7-201010050-00005. - DOI - PMC - PubMed
    1. Dumur C.I., Lyons-Weiler M., Sciulli C., Garrett C.T., Schrijver I., Holley T.K., Rodriguez-Paris J., Pollack J.R., Zehnder J.L., Price M., et al. Interlaboratory performance of a microarray-based gene expression test to determine tissue of origin in poorly differentiated and undifferentiated cancers. J. Mol. Diagn. 2008;10:67–77. doi: 10.2353/jmoldx.2008.070099. - DOI - PMC - PubMed
    1. Cronin M., Pho M., Dutta D., Stephans J.C., Shak S., Kiefer M.C., Esteban J.M., Baker J.B. Measurement of gene expression in archival paraffin-embedded tissues: Development and performance of a 92-gene reverse transcriptase-polymerase chain reaction assay. Am. J. Pathol. 2004;164:35–42. doi: 10.1016/S0002-9440(10)63093-3. - DOI - PMC - PubMed
    1. Bedard P.L., Mook S., Piccart-Gebhard M.J., Rutgers E.T., Van’t Veer L.J., Cardoso F. MammaPrint 70-gene profile quantifies the likelihood of recurrence for early breast cancer. Expert Opin. Med. Diagn. 2009;3:193–205. doi: 10.1517/17530050902751618. - DOI - PubMed
    1. Deng M.C., Eisen H.J., Mehra M.R., Billingham M., Marboe C.C., Berry G., Kobashigawa J., Johnson F.L., Starling R.C., Murali S., et al. Noninvasive discrimination of rejection in cardiac allograft recipients using gene expression profiling. Am. J. Transplant. 2006;6:150–160. doi: 10.1111/j.1600-6143.2005.01175.x. - DOI - PubMed

LinkOut - more resources