Two-stage model-based clustering for liquid chromatography mass spectrometry data analysis
- PMID: 19222382
- DOI: 10.2202/1544-6115.1308
Two-stage model-based clustering for liquid chromatography mass spectrometry data analysis
Abstract
Proteomic mass spectrometry is gaining an increasing role in diagnostics and in studies on protein complexes and biological systems. This experimental technology is producing high-throughput data which is inherently noisy and may contain various errors. Mathematical processing can help in removing them.In this paper we focus on the peak alignment problem in LC-MS spectra. As an alternative to heuristic approaches to the problem, we propose a mathematically sound method which exploits a model-based clustering. In this framework experiment errors are modeled as deviations from real values and mass spectra are regarded as finite Gaussian mixtures. The advantage of such an approach is that it provides convenient techniques for adjusting parameters and selecting solutions of best quality. The method can be parameterized by assuming various constraints. In this paper we investigate and compare different classes of models. We analyze the results in terms of statistically significant biomarkers that can be identified after the alignment of spectra. The study was conducted on a dataset of plasma samples of colorectal cancer patients and healthy donors.
Similar articles
-
New algorithms for processing and peak detection in liquid chromatography/mass spectrometry data.Rapid Commun Mass Spectrom. 2002;16(5):462-7. doi: 10.1002/rcm.600. Rapid Commun Mass Spectrom. 2002. PMID: 11857732
-
Normalization regarding non-random missing values in high-throughput mass spectrometry data.Pac Symp Biocomput. 2006:315-26. Pac Symp Biocomput. 2006. PMID: 17094249
-
A geometric approach for the alignment of liquid chromatography-mass spectrometry data.Bioinformatics. 2007 Jul 1;23(13):i273-81. doi: 10.1093/bioinformatics/btm209. Bioinformatics. 2007. PMID: 17646306
-
Technical, bioinformatical and statistical aspects of liquid chromatography-mass spectrometry (LC-MS) and capillary electrophoresis-mass spectrometry (CE-MS) based clinical proteomics: a critical assessment.J Chromatogr B Analyt Technol Biomed Life Sci. 2009 May 1;877(13):1250-8. doi: 10.1016/j.jchromb.2008.10.048. Epub 2008 Nov 6. J Chromatogr B Analyt Technol Biomed Life Sci. 2009. PMID: 19010091 Review.
-
Alignment of LC-MS images, with applications to biomarker discovery and protein identification.Proteomics. 2008 Feb;8(4):650-72. doi: 10.1002/pmic.200700791. Proteomics. 2008. PMID: 18297649 Review.
Cited by
-
Bioinformatics and computational biology in Poland.PLoS Comput Biol. 2013;9(5):e1003048. doi: 10.1371/journal.pcbi.1003048. Epub 2013 May 2. PLoS Comput Biol. 2013. PMID: 23658507 Free PMC article. No abstract available.
-
Image analysis tools and emerging algorithms for expression proteomics.Proteomics. 2010 Dec;10(23):4226-57. doi: 10.1002/pmic.200900635. Proteomics. 2010. PMID: 21046614 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials