Probabilistic disease classification of expression-dependent proteomic data from mass spectrometry of human serum
- PMID: 14980018
- DOI: 10.1089/106652703322756159
Probabilistic disease classification of expression-dependent proteomic data from mass spectrometry of human serum
Abstract
We have developed an algorithm called Q5 for probabilistic classification of healthy versus disease whole serum samples using mass spectrometry. The algorithm employs principal components analysis (PCA) followed by linear discriminant analysis (LDA) on whole spectrum surface-enhanced laser desorption/ionization time of flight (SELDI-TOF) mass spectrometry (MS) data and is demonstrated on four real datasets from complete, complex SELDI spectra of human blood serum. Q5 is a closed-form, exact solution to the problem of classification of complete mass spectra of a complex protein mixture. Q5 employs a probabilistic classification algorithm built upon a dimension-reduced linear discriminant analysis. Our solution is computationally efficient; it is noniterative and computes the optimal linear discriminant using closed-form equations. The optimal discriminant is computed and verified for datasets of complete, complex SELDI spectra of human blood serum. Replicate experiments of different training/testing splits of each dataset are employed to verify robustness of the algorithm. The probabilistic classification method achieves excellent performance. We achieve sensitivity, specificity, and positive predictive values above 97% on three ovarian cancer datasets and one prostate cancer dataset. The Q5 method outperforms previous full-spectrum complex sample spectral classification techniques and can provide clues as to the molecular identities of differentially expressed proteins and peptides.
Similar articles
-
Discrimination analysis of mass spectrometry proteomics for ovarian cancer detection.Acta Pharmacol Sin. 2008 Oct;29(10):1240-6. doi: 10.1111/j.1745-7254.2008.00861.x. Acta Pharmacol Sin. 2008. PMID: 18817630
-
Discovery of serum protein biomarkers for prostate cancer progression by proteomic analysis.Cancer Genomics Proteomics. 2010 Mar-Apr;7(2):93-103. Cancer Genomics Proteomics. 2010. PMID: 20335524
-
Feature selection and nearest centroid classification for protein mass spectrometry.BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68. BMC Bioinformatics. 2005. PMID: 15788095 Free PMC article.
-
Protein profiling as a diagnostic tool in clinical chemistry: a review.Clin Chem Lab Med. 2005;43(12):1281-90. doi: 10.1515/CCLM.2005.222. Clin Chem Lab Med. 2005. PMID: 16309363 Review.
-
The MALDI-TOF mass spectrometric view of the plasma proteome and peptidome.Clin Chem. 2006 Jul;52(7):1223-37. doi: 10.1373/clinchem.2006.069252. Epub 2006 Apr 27. Clin Chem. 2006. PMID: 16644871 Review.
Cited by
-
Resistor: An algorithm for predicting resistance mutations via Pareto optimization over multistate protein design and mutational signatures.Cell Syst. 2022 Oct 19;13(10):830-843.e3. doi: 10.1016/j.cels.2022.09.003. Cell Syst. 2022. PMID: 36265469 Free PMC article.
-
Clinical and prognostic usefulness of serum proteomic profile in hepatic colorectal metastases: a pilot prospective study.Clin Transl Oncol. 2013 Sep;15(9):691-7. doi: 10.1007/s12094-012-0990-0. Epub 2013 Jan 30. Clin Transl Oncol. 2013. PMID: 23361307
-
Application of matrix-assisted laser desorption/ionization mass spectrometry to identify species of Neotropical Anopheles vectors of malaria.Malar J. 2019 Mar 22;18(1):95. doi: 10.1186/s12936-019-2723-0. Malar J. 2019. PMID: 30902057 Free PMC article.
-
A classification method based on principal components of SELDI spectra to diagnose of lung adenocarcinoma.PLoS One. 2012;7(3):e34457. doi: 10.1371/journal.pone.0034457. Epub 2012 Mar 26. PLoS One. 2012. PMID: 22461913 Free PMC article.
-
Feed forward artificial neural network: tool for early detection of ovarian cancer.Sci Pharm. 2011 Jul-Sep;79(3):493-505. doi: 10.3797/scipharm.1105-11. Epub 2011 Jul 5. Sci Pharm. 2011. PMID: 21886899 Free PMC article.
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical