Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method
- PMID: 12973725
- DOI: 10.1002/pmic.200300515
Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method
Abstract
There are many data mining techniques for processing and general learning of multivariate data. However, we believe the wavelet transformation and latent variable projection method are particularly useful for spectroscopic and chromatographic data. Projection based methods are designed to handle hugely multivariate nature of such data effectively. For the actual analysis of the data we have used latent variable projection methods such as principal component analysis (PCA) and partial least squares projection to latent structures based discriminant analysis (PLS-DA) to analyze the raw data presented to the participants of the First Duke Proteomics Data Mining Conference. PCA was used to solve problem #1 (clustering problem) and the PLS-DA was used to solve problem #2 (classification problem). The idea of internal and external cross-validation was used to validate the model obtained from the classification analysis. The simple two-component PLS-DA model obtained from the analysis performed well. The model has completely separated the two groups from all the data. The same model applied on two-thirds of the data showed good performance by external validation with independent test set of remaining 13 specimens obtained by setting aside the spectra of every third specimen (accuracy of 85%).
Similar articles
-
Multiple approaches to data-mining of proteomic data based on statistical and pattern classification methods.Proteomics. 2003 Sep;3(9):1704-9. doi: 10.1002/pmic.200300512. Proteomics. 2003. PMID: 12973729
-
Feature selection and nearest centroid classification for protein mass spectrometry.BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68. BMC Bioinformatics. 2005. PMID: 15788095 Free PMC article.
-
Discriminant models for high-throughput proteomics mass spectrometer data.Proteomics. 2003 Sep;3(9):1699-703. doi: 10.1002/pmic.200300518. Proteomics. 2003. PMID: 12973728
-
Application of High Resolution Mass Spectrometric methods coupled with chemometric techniques in olive oil authenticity studies - A review.Anal Chim Acta. 2020 Oct 16;1134:150-173. doi: 10.1016/j.aca.2020.07.029. Epub 2020 Jul 30. Anal Chim Acta. 2020. PMID: 33059861 Review.
-
Functional genomics and proteomics in the clinical neurosciences: data mining and bioinformatics.Prog Brain Res. 2006;158:83-108. doi: 10.1016/S0079-6123(06)58004-5. Prog Brain Res. 2006. PMID: 17027692 Review.
Cited by
-
Comparative Metabonomic Investigations of Schistosoma japonicum From SCID Mice and BALB/c Mice: Clues to Developmental Abnormality of Schistosome in the Immunodeficient Host.Front Microbiol. 2019 Mar 12;10:440. doi: 10.3389/fmicb.2019.00440. eCollection 2019. Front Microbiol. 2019. PMID: 30915055 Free PMC article.
-
Primary Osteocyte Supernatants Metabolomic Profiling of Two Transgenic Mice With Connexin43 Dominant Negative Mutants.Front Endocrinol (Lausanne). 2021 May 18;12:649994. doi: 10.3389/fendo.2021.649994. eCollection 2021. Front Endocrinol (Lausanne). 2021. PMID: 34093433 Free PMC article.
-
A comparison of methods for classifying clinical samples based on proteomics data: a case study for statistical and machine learning approaches.PLoS One. 2011;6(9):e24973. doi: 10.1371/journal.pone.0024973. Epub 2011 Sep 28. PLoS One. 2011. PMID: 21969867 Free PMC article.
-
Sulfadiazine Sodium Ameliorates the Metabolomic Perturbation in Mice Infected with Toxoplasma gondii.Antimicrob Agents Chemother. 2019 Sep 23;63(10):e00312-19. doi: 10.1128/AAC.00312-19. Print 2019 Oct. Antimicrob Agents Chemother. 2019. PMID: 31383652 Free PMC article.
-
Gene features selection for three-class disease classification via multiple orthogonal partial least square discriminant analysis and S-plot using microarray data.PLoS One. 2013 Dec 30;8(12):e84253. doi: 10.1371/journal.pone.0084253. eCollection 2013. PLoS One. 2013. PMID: 24386356 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources