Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Feb 1;17(3):483.
doi: 10.3390/cancers17030483.

Machine Learning Methods in Clinical Flow Cytometry

Affiliations
Review

Machine Learning Methods in Clinical Flow Cytometry

Nicholas C Spies et al. Cancers (Basel). .

Abstract

This review will explore the integration of machine learning (ML) techniques to enhance the analysis of increasingly complex and voluminous flow cytometry data, as traditional manual methods are insufficient for handling this data. We attempt to provide a comprehensive introduction to ML in flow cytometry, detailing the transition from manual gating to computational methods and emphasizing the importance of data quality. Key ML techniques are discussed, including supervised learning methods like logistic regression, support vector machines, and neural networks, which rely on labeled data to classify disease states. Unsupervised methods, such as k-means clustering, FlowSOM, UMAP, and t-SNE, are highlighted for their ability to identify novel cell populations without predefined labels. We also delve into newer semi-supervised and weakly supervised methods, which leverage partial labeling to improve model performance. Practical aspects of implementing ML in clinical settings are addressed, including regulatory considerations, data preprocessing, model training, validation, and the importance of generalizability, and we underscore the collaborative effort required among pathologists, data scientists, and laboratory professionals to ensure robust model development and deployment. Finally, we show the transformative potential of ML in flow cytometry in uncovering new biological insights through advanced computational techniques.

Keywords: acute leukemia; clinical flow cytometry; machine learning; operational efficiency.

PubMed Disclaimer

Conflict of interest statement

The authors have no conflicts to disclose.

Figures

Figure 1
Figure 1
A diagrammatic overview of the model development workflow for supervised machine learning, from data annotation through training and validation to production-ready inference.
Figure 2
Figure 2
Unsupervised learning algorithms. Labelled cells with unknown identities are analyzed by flow cytometry and partitioned in the feature space based on marker expression. Clustering methods are used to group data points based on their marker profiles, while dimensionality reduction can be applied to high-parameter datasets to simplify their representation and enhance the biological interpretability. These tools output cell annotations to identify distinct populations, enabling the characterization of disease states and facilitating novel discoveries.
Figure 3
Figure 3
Twin networks: one neural network model is learned on paired examples. Pairs are weakly supervised, only requiring a label for if they are the “same” or not. N.b.: this supervised signal could be developed via an unsupervised method.
Figure 4
Figure 4
AI model lifecycle from initial data analysis to model development and operationalization.

References

    1. Lansdorp P.M., Smith C., Safford M., Terstappen L.W., Thomas T.E. Single laser three color immunofluorescence staining procedures based on energy transfer between phycoerythrin and cyanine 5. Cytometry. 1991;12:723–730. doi: 10.1002/cyto.990120806. - DOI - PubMed
    1. Perfetto S.P., Chattopadhyay P.K., Roederer M. Seventeen-colour flow cytometry: Unravelling the immune system. Nat. Rev. Immunol. 2004;4:648–655. doi: 10.1038/nri1416. - DOI - PubMed
    1. Nolan J.P., Condello D. Spectral flow cytometry. Curr. Protoc. Cytom. 2013;1:1.27.1–1.27.13. doi: 10.1002/0471142956.cy0127s63. - DOI - PMC - PubMed
    1. Sahir F., Mateo J.M., Steinhoff M., Siveen K.S. Development of a 43 color panel for the characterization of conventional and unconventional T-cell subsets, B cells, NK cells, monocytes, dendritic cells, and innate lymphoid cells using spectral flow cytometry. Cytom. A. 2020;105:404–410. doi: 10.1002/cyto.a.24288. - DOI - PMC - PubMed
    1. Flores-Montero J., Sanoja-Flores L., Paiva B., Puig N., García-Sánchez O., Böttcher S., van der Velden V.H.J., Pérez-Morán J.-J., Vidriales M.-B., García-Sanz R., et al. Next Generation Flow for highly sensitive and standardized detection of minimal residual disease in multiple myeloma. Leukemia. 2017;31:2094–2103. doi: 10.1038/leu.2017.29. - DOI - PMC - PubMed

LinkOut - more resources