SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
- PMID: 37792497
- PMCID: PMC10568371
- DOI: 10.1093/bioinformatics/btad593
SPA-STOCSY: an automated tool for identifying annotated and non-annotated metabolites in high-throughput NMR spectra
Abstract
Motivation: Nuclear magnetic resonance spectroscopy (NMR) is widely used to analyze metabolites in biological samples, but the analysis requires specific expertise, it is time-consuming, and can be inaccurate. Here, we present a powerful automate tool, SPatial clustering Algorithm-Statistical TOtal Correlation SpectroscopY (SPA-STOCSY), which overcomes challenges faced when analyzing NMR data and identifies metabolites in a sample with high accuracy.
Results: As a data-driven method, SPA-STOCSY estimates all parameters from the input dataset. It first investigates the covariance pattern among datapoints and then calculates the optimal threshold with which to cluster datapoints belonging to the same structural unit, i.e. the metabolite. Generated clusters are then automatically linked to a metabolite library to identify candidates. To assess SPA-STOCSY's efficiency and accuracy, we applied it to synthesized spectra and spectra acquired on Drosophila melanogaster tissue and human embryonic stem cells. In the synthesized spectra, SPA outperformed Statistical Recoupling of Variables (SRV), an existing method for clustering spectral peaks, by capturing a higher percentage of the signal regions and the close-to-zero noise regions. In the biological data, SPA-STOCSY performed comparably to the operator-based Chenomx analysis while avoiding operator bias, and it required <7 min of total computation time. Overall, SPA-STOCSY is a fast, accurate, and unbiased tool for untargeted analysis of metabolites in the NMR spectra. It may thus accelerate the use of NMR for scientific discoveries, medical diagnostics, and patient-specific decision making.
Availability and implementation: The codes of SPA-STOCSY are available at https://github.com/LiuzLab/SPA-STOCSY.
© The Author(s) 2023. Published by Oxford University Press.
Conflict of interest statement
None declared.
Figures
Update of
-
SPA-STOCSY: An Automated Tool for Identification of Annotated and Non-Annotated Metabolites in High-Throughput NMR Spectra.bioRxiv [Preprint]. 2023 Feb 22:2023.02.22.529564. doi: 10.1101/2023.02.22.529564. bioRxiv. 2023. Update in: Bioinformatics. 2023 Oct 3;39(10):btad593. doi: 10.1093/bioinformatics/btad593. PMID: 36865102 Free PMC article. Updated. Preprint.
References
-
- Alonso A, Rodríguez MA, Vinaixa M et al. Focus: a robust workflow for one-dimensional NMR spectral analysis. Anal Chem 2014;86:1160–9. - PubMed
-
- Alves AC, Rantalainen M, Holmes E et al. Analytic properties of statistical total correlation spectroscopy based information recovery in 1H NMR metabolic data sets. Anal Chem 2009;81:2075–84. - PubMed
-
- Blaise BJ, Shintu L, Elena B et al. Statistical recoupling prior to significance testing in nuclear magnetic resonance based metabonomics. Anal Chem 2009;81:6242–51. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Molecular Biology Databases
Research Materials
