Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery
- PMID: 20122180
- PMCID: PMC3009481
- DOI: 10.1186/1471-2105-11-S1-S1
Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery
Abstract
Background: As a novel cancer diagnostic paradigm, mass spectroscopic serum proteomic pattern diagnostics was reported superior to the conventional serologic cancer biomarkers. However, its clinical use is not fully validated yet. An important factor to prevent this young technology to become a mainstream cancer diagnostic paradigm is that robustly identifying cancer molecular patterns from high-dimensional protein expression data is still a challenge in machine learning and oncology research. As a well-established dimension reduction technique, PCA is widely integrated in pattern recognition analysis to discover cancer molecular patterns. However, its global feature selection mechanism prevents it from capturing local features. This may lead to difficulty in achieving high-performance proteomic pattern discovery, because only features interpreting global data behavior are used to train a learning machine.
Methods: In this study, we develop a nonnegative principal component analysis algorithm and present a nonnegative principal component analysis based support vector machine algorithm with sparse coding to conduct a high-performance proteomic pattern classification. Moreover, we also propose a nonnegative principal component analysis based filter-wrapper biomarker capturing algorithm for mass spectral serum profiles.
Results: We demonstrate the superiority of the proposed algorithm by comparison with six peer algorithms on four benchmark datasets. Moreover, we illustrate that nonnegative principal component analysis can be effectively used to capture meaningful biomarkers.
Conclusion: Our analysis suggests that nonnegative principal component analysis effectively conduct local feature selection for mass spectral profiles and contribute to improving sensitivities and specificities in the following classification, and meaningful biomarker discovery.
Figures



Similar articles
-
Derivative component analysis for mass spectral serum proteomic profiles.BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S5. doi: 10.1186/1755-8794-7-S1-S5. Epub 2014 May 8. BMC Med Genomics. 2014. PMID: 25080317 Free PMC article.
-
Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.Genome Inform. 2008;21:200-11. Genome Inform. 2008. PMID: 19425159
-
Nonnegative principal component analysis for cancer molecular pattern discovery.IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):537-49. doi: 10.1109/TCBB.2009.36. IEEE/ACM Trans Comput Biol Bioinform. 2010. PMID: 20671323
-
Feature selection and machine learning with mass spectrometry data.Methods Mol Biol. 2010;593:205-29. doi: 10.1007/978-1-60327-194-3_11. Methods Mol Biol. 2010. PMID: 19957152 Review.
-
Proteomic cancer classification with mass spectrometry data.Am J Pharmacogenomics. 2005;5(5):281-92. doi: 10.2165/00129785-200505050-00001. Am J Pharmacogenomics. 2005. PMID: 16196498 Review.
Cited by
-
Molecular typing of Meningiomas by Desorption Electrospray Ionization Mass Spectrometry Imaging for Surgical Decision-Making.Int J Mass Spectrom. 2015 Feb 1;377:690-698. doi: 10.1016/j.ijms.2014.06.024. Int J Mass Spectrom. 2015. PMID: 25844057 Free PMC article.
-
Overcome support vector machine diagnosis overfitting.Cancer Inform. 2014 Dec 9;13(Suppl 1):145-58. doi: 10.4137/CIN.S13875. eCollection 2014. Cancer Inform. 2014. PMID: 25574125 Free PMC article. Review.
-
Derivative component analysis for mass spectral serum proteomic profiles.BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S5. doi: 10.1186/1755-8794-7-S1-S5. Epub 2014 May 8. BMC Med Genomics. 2014. PMID: 25080317 Free PMC article.
-
Pseudomonas species prevalence, protein analysis, and antibiotic resistance: an evolving public health challenge.AMB Express. 2022 May 9;12(1):53. doi: 10.1186/s13568-022-01390-1. AMB Express. 2022. PMID: 35532863 Free PMC article.
-
Non-negative matrix factorisation methods for the spectral decomposition of MRS data from human brain tumours.BMC Bioinformatics. 2012 Mar 8;13:38. doi: 10.1186/1471-2105-13-38. BMC Bioinformatics. 2012. PMID: 22401579 Free PMC article.
References
-
- Mantini D, Petrucci F, Del Boccio P, Pieragostino D, Di Nicola M, Lugaresi A, Federici G, Sacchetta P, Di Ilio C, Urbani A. Independent component analysis for the extraction of reliable protein signal profiles from maldi-tof mass spectra. Bioinformatics. 2008;24(1):63–70. doi: 10.1093/bioinformatics/btm533. - DOI - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials