Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2011 Jan;79(1):6-13.
doi: 10.1002/cyto.a.21007.

Rapid cell population identification in flow cytometry data

Affiliations
Comparative Study

Rapid cell population identification in flow cytometry data

Nima Aghaeepour et al. Cytometry A. 2011 Jan.

Abstract

We have developed flowMeans, a time-efficient and accurate method for automated identification of cell populations in flow cytometry (FCM) data based on K-means clustering. Unlike traditional K-means, flowMeans can identify concave cell populations by modelling a single population with multiple clusters. flowMeans uses a change point detection algorithm to determine the number of sub-populations, enabling the method to be used in high throughput FCM data analysis pipelines. Our approach compares favorably to manual analysis by human experts and current state-of-the-art automated gating algorithms. flowMeans is freely available as an open source R package through Bioconductor.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An example of finding the change point using segmented regression. The chosen solution (shown in red) consists of 6 populations.
Figure 2
Figure 2
The number of clusters selected by manual analysis and the three algorithms for the (a)GvHD and (b)DLBCL datasets.
Figure 3
Figure 3
Agreement between F-measures of flowMeans and either flowMerge(a,b) or FLAME(c,d) on GvHD(a,c) and DLBCL(b,d) datasets. The cell populations for the samples indicated with red Xs in panels (a)-(d) are shown in respective panels in Figure 4. The dashed line is the agreement line (i.e., y = x) that indicates where the performance of the two algorithms is equal. The correlation coefficient (CC) and concordance correlation coefficient (CCC) are shown as legends.
Figure 4
Figure 4
Panels (a)-(d) illustrate the cell populations found by flowMeans, flowMerge, and FLAME for the samples shown with red X’s in respective panels in Figure 3. In this figure, the >90th percentiles of each cluster are visualized to make the boundaries more robust after projection to a two dimensional scatter plot. Therefore the populations might be different from the real distributions on the margins. The pink cluster in panel (d) is a multi-modal population with 2 high-density regions. In every panel, colors of each solution are matched with the solution with the maximum number of clusters.

Comment in

References

    1. Bocsi J, Tárnok A. Toward automation of flow data analysis. Cytometry. Part A. 2008;73(8):679–680. - PubMed
    1. Lugli E, Roederer M, Cossarizza A. Good cell, bad cell: flow cytometry reveals t-cell subsets important in hiv disease. Cytometry Part A. 2010;77A(2):614–622. - PMC - PubMed
    1. Chattopadhyay P, Roederer M. Data analysis in flow cytometry: The future just started. Cytometry Part A. 2010;77A(2):705–713. - PMC - PubMed
    1. Satoh C, Dan K, Yamashita T, Jo R, Tamura H, Ogata K. Flow cytometric parameters with little interexaminer variability for diagnosing low-grade myelodysplastic syndromes. Leukemia Research. 2008;32(5):699–707. - PubMed
    1. Gratama J, Kraan J, Keeney M, Granger V, Barnett D. Reduction of variation in T-cell subset enumeration among 55 laboratories using single-platform, three or four-color flow cytometry based on CD45 and SSC-based gating of lymphocytes. Cytometry Part B: Clinical Cytometry. 2002;50(2):92–101. - PubMed

Publication types