Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Aug 1;195(3):773-9.
doi: 10.4049/jimmunol.1500633.

Algorithmic Tools for Mining High-Dimensional Cytometry Data

Affiliations
Review

Algorithmic Tools for Mining High-Dimensional Cytometry Data

Cariad Chester et al. J Immunol. .

Abstract

The advent of mass cytometry has led to an unprecedented increase in the number of analytes measured in individual cells, thereby increasing the complexity and information content of cytometric data. Although this technology is ideally suited to the detailed examination of the immune system, the applicability of the different methods for analyzing such complex data is less clear. Conventional data analysis by manual gating of cells in biaxial dot plots is often subjective, time consuming, and neglectful of much of the information contained in a highly dimensional cytometric dataset. Algorithmic data mining has the promise to eliminate these concerns, and several such tools have been applied recently to mass cytometry data. We review computational data mining tools that have been used to analyze mass cytometry data, outline their differences, and comment on their strengths and limitations. This review will help immunologists to identify suitable algorithmic tools for their particular projects.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Principal component analysis of a 3-dimensional point cloud
The principal components are ordered by their capacity to explain the variance in the data.
Figure 2
Figure 2. Swiss roll
The artificial Swiss roll data provides an example of how linear dimensionality reduction can be misleading. Points A and B are proximate when considering their linear relationship in Euclidean space. However, when the entire structure of the data manifold is considered, the points are distant.
Figure 3
Figure 3. Hierarchical, agglomerative clustering
In agglomerative clustering, each point in multidimensional space is initially its own cluster. Then, the most phenotypically similar “clusters” merge into an agglomerative, parent cluster. This procedure then iterates until the target number of clusters is reached. Both SPADE and Citrus utilize agglomerative clustering.
Figure 4
Figure 4. Self-Organizing Map with Start Charts
A self-organizing map (SOM) is a type of artificial neural network used for clustering. In SOM-based clustering, each cell is assigned to a phenotypically similar node in the network. Characteristics of each node can be visualized via star charts. Star charts provide a way to visualize the mean intensities of the markers for all cells in the dataset assigned to a specific node. The height of each wedge within the star chart indicates the markers intensity (i.e. if the wedge reaches the border of the circle, the cells have a high expression of that marker).

References

    1. Maecker HT, McCoy JP, Nussenblatt R. Standardizing immunophenotyping for the Human Immunology Project. Nat Rev Immunol. 2012;12:191–200. - PMC - PubMed
    1. Lugli E, Roederer M, Cossarizza A. Data analysis in flow cytometry: the future just started. Cytometry A. 2010;77:705–713. - PMC - PubMed
    1. Ornatsky OI, Kinach R, Bandura DR, Lou X, Tanner SD, Baranov VI, Nitz M, Winnik MA. Development of analytical methods for multiplex bio-assay with inductively coupled plasma mass spectrometry. J Anal At Spectrom. 2008;23:463–469. - PMC - PubMed
    1. Bendall SC, Nolan GP, Roederer M, Chattopadhyay PK. A deep profiler's guide to cytometry. Trends Immunol. 2012;33:323–332. - PMC - PubMed
    1. Aghaeepour N, Finak G, Hoos H, Mosmann TR, Brinkman R, Gottardo R, Scheuermann RH. Critical assessment of automated flow cytometry data analysis techniques. Nat Methods. 2013;10:228–238. - PMC - PubMed

Publication types

Substances