Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Sep 10:4:1457619.
doi: 10.3389/fbinf.2024.1457619. eCollection 2024.

A review of model evaluation metrics for machine learning in genetics and genomics

Affiliations
Review

A review of model evaluation metrics for machine learning in genetics and genomics

Catriona Miller et al. Front Bioinform. .

Abstract

Machine learning (ML) has shown great promise in genetics and genomics where large and complex datasets have the potential to provide insight into many aspects of disease risk, pathogenesis of genetic disorders, and prediction of health and wellbeing. However, with this possibility there is a responsibility to exercise caution against biases and inflation of results that can have harmful unintended impacts. Therefore, researchers must understand the metrics used to evaluate ML models which can influence the critical interpretation of results. In this review we provide an overview of ML metrics for clustering, classification, and regression and highlight the advantages and disadvantages of each. We also detail common pitfalls that occur during model evaluation. Finally, we provide examples of how researchers can assess and utilise the results of ML models, specifically from a genomics perspective.

Keywords: classification; clustering; disease prediction; genomics prediction; machine learning; metrics; regression.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Flowchart showing four categories of machine learning. This review focuses on three subcategories (classification, regression, and clustering) within the supervised and unsupervised categories.
FIGURE 2
FIGURE 2
Illustration of cluster metric calculations. Extrinsic validation methods require known clusters to compare against whilst intrinsic validation does not.
FIGURE 3
FIGURE 3
Illustration of classification metrics. (A) Confusion matrix used to calculate precision and recall. (B) the score distribution and threshold that gives the confusion matrix in (A). Every score below the dashed line is assigned to the negative class whilst scores after the dashed line are assigned to the positive class. (C) An Area Under the Receiver-Operator Curve (AUROC) graph for the given score distribution. Different chosen thresholds (dashed lines) give different ratios of FPR to TPR. (D) AUROC graphs for the three distribution patterns. Pink shows complete separation, blue is partial separation, and yellow is complete crossover.

References

    1. Ali M. (2020). PyCaret: an open source, low-code machine learning library in Python. Available at: https://www.pycaret.org.
    1. Al-Tashi Q., Saad M. B., Muneer A., Qureshi R., Mirjalili S., Sheshadri A., et al. (2023). Machine learning models for the identification of prognostic and predictive cancer biomarkers: a systematic review. Int. J. Mol. Sci. 2023, 7781. 10.3390/ijms24097781 - DOI - PMC - PubMed
    1. Alyousef A. A., Nihtyanova S., Denton C., Bosoni P., Bellazzi R., Tucker A. (2018). Nearest consensus clustering classification to identify subclasses and predict disease. J. Healthc. Inf. Res. 2, 402–422. 10.1007/s41666-018-0029-6 - DOI - PMC - PubMed
    1. Awada H., Durmaz A., Gurnari C., Kishtagari A., Meggendorfer M., Kerr C. M., et al. (2021). Machine learning integrates genomic signatures for subclassification beyond primary and secondary acute myeloid leukemia. Blood 138, 1885–1895. 10.1182/blood.2020010603 - DOI - PMC - PubMed
    1. Babichev S., Lytvynenko M. A. T., Osypenko V. (2017). “Criterial analysis of gene expression sequences to create the objective clustering inductive technology,” in 2017 IEEE 37th international conference on electronics and nanotechnology (ELNANO) (IEEE; ).

LinkOut - more resources