Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010:641:143-66.
doi: 10.1007/978-1-60761-711-2_9.

Statistical contributions to proteomic research

Affiliations

Statistical contributions to proteomic research

Jeffrey S Morris et al. Methods Mol Biol. 2010.

Abstract

Proteomic profiling has the potential to impact the diagnosis, prognosis, and treatment of various diseases. A number of different proteomic technologies are available that allow us to look at many proteins at once, and all of them yield complex data that raise significant quantitative challenges. Inadequate attention to these quantitative issues can prevent these studies from achieving their desired goals, and can even lead to invalid results. In this chapter, we describe various ways the involvement of statisticians or other quantitative scientists in the study team can contribute to the success of proteomic research, and we outline some of the key statistical principles that should guide the experimental design and analysis of such studies.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Discovery of clusters in data from bsa70 fraction of brain tumor samples.
Figure 2
Figure 2
Unsupervised clustering of Leukemia spectra reveals clustering driven by run date, not type of leukemia.
Figure 3
Figure 3
Plot reveals confounding between cancer status and run date in the study design, which is especially problematic given the quality control problems evident on day 3.
Figure 4
Figure 4
A simple heat map revealed peaks at integer multiples of 180.6, which is unlikely to be driven by biology, so we removed these peaks from consideration when discriminating between cancers and normals.
Figure 5
Figure 5
Heat map of all 216 samples run on the H4 chip (top), and on the WCX2 chip (bottom). The extreme difference in the ‘benign disease’ group in the spectra on the top, along with the similarity of these profiles to the WCX2 spectra, suggest a change in protocol occurred in the middle of the first experiment.
Figure 6
Figure 6
Comparisons of urine spectra in the vicinity of M/Z 3400, revealing calibration issues between the spectra in different groups.
Figure 7
Figure 7
Portion of a raw MALDI spectrum (top), along with the preprocessed version after denoising, baseline correction and normalization (bottom) using the method described in Morris, et al. (2005), with detected peaks indicated by the dots.
Figure 8
Figure 8
Average gel from a 2d gel experiment, with detected spots using the Pinnacle method indicated the symbol “x”.

Similar articles

Cited by

References

    1. Baggerly KA, Morris JS, Wang J, Gold D, Xiao LC, Coombes KR. A comprehensive approach to the analysis of matrix-assisted laser desorption/ionization time of flight proteomics spectra from serum samples. Proteomics. 2003;3:1667–1672. - PubMed
    1. Baggerly KA, Edmonson S, Morris JS, Coombes KR. High-Resolution Serum Proteomic Patterns for Ovarian Cancer Detection. Endocrine-Related Cancers. 2004;11(4):583–584. - PubMed
    1. Baggerly KA, Morris JS, Coombes KR. Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments. Bioinformatics. 2004;20:777–785. - PubMed
    1. Baggerly KA, Coombes KR, Morris JS. Are the NCI/FDA Ovarian Proteomic Data Biased? A Reply to Producers and Consumers. Cancer Informatics. 2005;1(1):9–14. - PMC - PubMed
    1. Baggerly KA, Morris JS, Edmonson S, Coombes KR. Signal in Noise: Evaluating Reported Reproducibility of Serum Proteomic Tests for Ovarian Cancer. Journal of the National Cancer Institute. 2005;97:307–309. - PubMed

Publication types

LinkOut - more resources