Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Jan 17:9:1070394.
doi: 10.3389/fmolb.2022.1070394. eCollection 2022.

KODAMA exploratory analysis in metabolic phenotyping

Affiliations
Review

KODAMA exploratory analysis in metabolic phenotyping

Maria Mgella Zinga et al. Front Mol Biosci. .

Erratum in

Abstract

KODAMA is a valuable tool in metabolomics research to perform exploratory analysis. The advanced analytical technologies commonly used for metabolic phenotyping, mass spectrometry, and nuclear magnetic resonance spectroscopy push out a bunch of high-dimensional data. These complex datasets necessitate tailored statistical analysis able to highlight potentially interesting patterns from a noisy background. Hence, the visualization of metabolomics data for exploratory analysis revolves around dimensionality reduction. KODAMA excels at revealing local structures in high-dimensional data, such as metabolomics data. KODAMA has a high capacity to detect different underlying relationships in experimental datasets and correlate extracted features with accompanying metadata. Here, we describe the main application of KODAMA exploratory analysis in metabolomics research.

Keywords: KODAMA; clustering; metabolomics; semi-supervised; unsupervised.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
KODAMA on machine learning algorithm’s map. The machine learning algorithm can be categorized in unsupervised (i.e., clustering and dimensionality reduction method) and supervised learning. KODAMA is one of unsupervised learning methods used for dimensionality reduction. Optionally, if supervised information is used to lead the process of discovery of new patterns, KODAMA can be classified as semi-supervised.
FIGURE 2
FIGURE 2
KODAMA accuracy maximization by iterative cross-validations. (A) Cross-validation model (CV) generates predicted labels (PLs) that are used to calculate the accuracy value (AC). (B) Generation of new labels to conduct the process of accuracy maximization can be i) an unsupervised method, randomly swapping some class labels of misleading samples with predicted labels; ii) semi-supervised type-I, changing only predefined labels and maintaining assigned class labels; or iii) semi-supervised type-II, changing groups of labels together forcing their belonging to the same class. (C) Generation of new labels is an iterative process aimed to identify the labels with the highest cross-validated accuracy. (D) Accuracy values increase with the number of iterations. (E) KODAMA dissimilarity matrix generated as output can be transformed with MDS, or t-SNE, in a low-dimensional space.

References

    1. Al Bataineh M. T., Soares N. C., Semreen M. H., Cacciatore S., Dash N. R., Hamad M., et al. (2021). Candida albicans PPG1, a serine/threonine phosphatase, plays a vital role in central carbon metabolisms under filament-inducing conditions: A multi-omics approach. Plos one 16 (12), e0259588. 10.1371/journal.pone.0259588 - DOI - PMC - PubMed
    1. Bataineh M. T. A., Cacciatore S., Semreen M. H., Soares N. C., Zhu X., Dash N. R., et al. (2022). Exploring the effect of estrogen on Candida albicans hyphal cell wall glycans and ergosterol synthesis. Front. Cell. Infect. Microbiol. 12, 977157. 10.3389/fcimb.2022.977157 - DOI - PMC - PubMed
    1. Becht E., McInnes L., Healy J., Dutertre C.-A., Kwok I. W., Ng L. G., et al. (2019). Dimensionality reduction for visualizing single-cell data using UMAP. Nat. Biotechnol. 37 (1), 38–44. 10.1038/nbt.4314 - DOI - PubMed
    1. Bender A., Brown N. (2018). Cheminformatics in drug discovery. Wiley Online Library. - PubMed
    1. Berry M. W., Mohamed A., Yap B. W. (2019). Supervised and unsupervised learning for data science. Springer.

LinkOut - more resources