Biomarker discovery and redundancy reduction towards classification using a multi-factorial MALDI-TOF MS T2DM mouse model dataset
- PMID: 21554713
- PMCID: PMC3116487
- DOI: 10.1186/1471-2105-12-140
Biomarker discovery and redundancy reduction towards classification using a multi-factorial MALDI-TOF MS T2DM mouse model dataset
Abstract
Background: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics.
Results: We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods.
Conclusions: The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author.
Figures






Similar articles
-
Pancreatic cancer biomarkers discovery by surface-enhanced laser desorption and ionization time-of-flight mass spectrometry.Clin Chem Lab Med. 2009;47(6):713-23. doi: 10.1515/CCLM.2009.158. Clin Chem Lab Med. 2009. PMID: 19426140
-
A new strategy for faster urinary biomarkers identification by Nano-LC-MALDI-TOF/TOF mass spectrometry.BMC Genomics. 2008 Nov 14;9:541. doi: 10.1186/1471-2164-9-541. BMC Genomics. 2008. PMID: 19014585 Free PMC article.
-
Detection and identification of a protein biomarker in antibiotic-resistant Escherichia coli using intact protein LC offline MALDI-MS and MS/MS.J Appl Microbiol. 2020 Mar;128(3):697-709. doi: 10.1111/jam.14507. Epub 2019 Dec 9. J Appl Microbiol. 2020. PMID: 31715076 Free PMC article.
-
Reproducibility in protein profiling by MALDI-TOF mass spectrometry.Clin Chem. 2007 May;53(5):852-8. doi: 10.1373/clinchem.2006.082644. Epub 2007 Mar 29. Clin Chem. 2007. PMID: 17395711 Review.
-
MALDI-TOF MS as evolving cancer diagnostic tool: a review.J Pharm Biomed Anal. 2014 Jul;95:245-55. doi: 10.1016/j.jpba.2014.03.007. Epub 2014 Mar 15. J Pharm Biomed Anal. 2014. PMID: 24699369 Review.
Cited by
-
Recent applications of chemometrics in one- and two-dimensional chromatography.J Sep Sci. 2020 May;43(9-10):1678-1727. doi: 10.1002/jssc.202000011. Epub 2020 Mar 19. J Sep Sci. 2020. PMID: 32096604 Free PMC article. Review.
-
Informed baseline subtraction of proteomic mass spectrometry data aided by a novel sliding window algorithm.Proteome Sci. 2016 Dec 7;14:19. doi: 10.1186/s12953-016-0107-8. eCollection 2016. Proteome Sci. 2016. PMID: 27980460 Free PMC article.
-
MicroRNA profiling of dogs with transitional cell carcinoma of the bladder using blood and urine samples.BMC Vet Res. 2017 Nov 15;13(1):339. doi: 10.1186/s12917-017-1259-1. BMC Vet Res. 2017. PMID: 29141625 Free PMC article.
-
Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins.BMC Res Notes. 2013 Sep 8;6:358. doi: 10.1186/1756-0500-6-358. BMC Res Notes. 2013. PMID: 24010718 Free PMC article.
References
-
- Tiffin N, Adie E, Turner F, Brunner HG, van Driel MA, Oti M, Lopez-Bigas N, Ouzounis C, Perez-Iratxeta C, Andrade-Navarro MA, Adeyemo A, Patti ME, Semple CA, Hide W. Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes. Nucleic Acids Res. 2006;34:3067–3081. doi: 10.1093/nar/gkl381. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical