Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 4;15(1):11544.
doi: 10.1038/s41598-025-90592-1.

Topological data analysis of pattern formation of human induced pluripotent stem cell colonies

Affiliations

Topological data analysis of pattern formation of human induced pluripotent stem cell colonies

Iryna Hartsock et al. Sci Rep. .

Abstract

Understanding the multicellular organization of stem cells is vital for determining the mechanisms that coordinate cell fate decision-making during differentiation; these mechanisms range from neighbor-to-neighbor communication to tissue-level biochemical gradients. Current methods for quantifying multicellular patterning tend to capture the spatial properties of cell colonies at a fixed scale and typically rely on human annotation. We present a computational pipeline that utilizes topological data analysis to generate quantitative, multiscale descriptors which capture the shape of data extracted from 2D multichannel microscopy images. By applying our pipeline to certain stem cell colonies, we detected subtle differences in patterning that reflect distinct spatial organization associated with loss of pluripotency. These results yield insight into putative directed cellular organization and morphogen-mediated, neighbor-to-neighbor signaling. Because of its broad applicability to immunofluorescence microscopy images, our pipeline is well-positioned to serve as a general-purpose tool for the quantitative study of multicellular pattern formation.

Keywords: Cell pattern formation; Microscopy images; Persistence landscapes; Pluripotent stem cells; Topological data analysis.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interest.

Figures

Fig. 1
Fig. 1
Our pipeline extracts topological descriptors from microscopy images of multicellular colonies. (a) An input microscopy image of cells with a nucleus signal in blue and two other signals: red (R) and green (G) (scale bar, formula image). This image is processed through the segmentation module of our pipeline to identify cell locations and associate a signal intensity to each cell. (b) A discretized version of the microscopy image in which cells are represented as points in the Euclidean plane and categorized into one of four cell types based on signal intensities by the cell type identification module. For the upper right patch, points in each cell type are shown. In general, there will be formula image cell types identified for n signals based on signal intensity. (c) The points of a single cell type (formula image) in the patch. (d) The corresponding Voronoi diagram, a partition of the patch into regions enclosing a portion of the plane that is closest to each point. (e) A step in the Delaunay filtration for the top right part of the Voronoi diagram, built by connecting neighboring points with line segments and triangles using a proximity radius r. (f) Four stages in the Delaunay filtration of the patch, an increasing sequence of simplicial complexes, with the three most persistent enclosed empty regions shown at the radius r (in pixels) at which they appear, colored in blue, green, and purple, where we have listed the colors in order of decreasing persistence. (g) The resulting persistence diagram, a topological summary that encodes the radii at which holes appear and disappear, with points corresponding to the most persistent enclosed empty regions highlighted in the same blue, green, and purple colors. (h) The corresponding persistence landscape, a decreasing sequence of piecewise-linear functions, can be combined with statistical and machine learning tools. This is the primary output of the TDA module in our pipeline. For simplicity, the formula image axis is labeled ‘Radius’ and the formula image axis is labeled ‘Persistence’. Note that the blue, green, and purple points in the persistence diagram map to points of the same color in the persistence landscape.
Fig. 2
Fig. 2
Artificial induction of exogenous GATA6-HA occurs within the context of endogenous GATA6 expression. (a) Gene circuit for chemical induction of GATA6-HA expression. The pan-GATA6 antibody can detect both induced GATA6-HA and endogenous GATA6 (i.e. the total amount of GATA6), whereas the HA antibody can only detect induced GATA6-HA (i.e. a subset produced via induction of the total amount of GATA6). (b) Representative immunofluorescence images of NANOG and pan-GATA6 or HA at 0 and 25 ng/ml Dox concentrations (scale bar, formula image). Using Volocity, we applied the gamma changes (gamma of 1.5) after brightness enhancement on all stitched large images used in the figure for better contrast for representation (see Methods). (c) Quantification of segmented images by each channel from (b). Green (NANOG) and red (pan-GATA6 or HA) fluorescent intensities are normalized to the corresponding nuclear Hoechst value in blue. The threshold for the signal in the green channel is given by the green dotted line, and the red dotted line indicates the threshold for the signal in the red channel. The four cell types based on these thresholds are given by the labels formula image, formula image, formula image, and formula image of the corresponding quadrant.
Fig. 3
Fig. 3
Comparison of average persistence landscapes across various Dox concentrations for formula image cell type in the HA group. As Dox concentration increases, the average persistence landscape gets taller and narrower.
Fig. 4
Fig. 4
We performed support vector regression on (a) persistence landscape vectors and (b) cell count vectors extracted from patches; we then averaged the Dox concentration predictions of each image’s patches. The image predictions of each Dox treatment group are spread horizontally for better visualization, and the outliers are marked with gray points. The Dox concentration predictions of 5 ng/ml, 15 ng/ml, and 25 ng/ml images are close to their actual values and well separated from one another when using persistence landscape vectors (a, top). When using cell count vectors (b, bottom) the results are broadly similar – there is no overlap between the predictions in the second and third quartiles (the boxes in the box and whisker plots) for the 5 ng/ml, 15 ng/ml, and 25 ng/ml images. However, in contrast to the persistence landscape predictions, the cell count predictions in the fourth quartile for the 5 ng/ml images overlap with those in the first quartile for 15 ng/ml images (the whiskers in the box and whisker plots) and the predictions in the fourth quartile for the 15 ng/ml images overlap with those in the first quartile for 25 ng/ml images. The Dox concentration predictions of 0 ng/ml images are not well distinguished from 5 ng/ml images in both cases.
Fig. 5
Fig. 5
Comparison between pattern formations in the pan-GATA6 and HA groups for formula image cell type with 25 ng/ml Dox concentration. (a) Average persistence landscapes of the pan-GATA6 and HA groups and their difference. The gray vertical line splits the difference plot into two parts showing distinct types of dissimilarities between the cell differentiation patterns. (b) The TDA pipeline applied to a representative patch of the pan-GATA6 group, and (c) a representative patch of the HA group. For each patch, its persistence diagram, persistence landscape, and most persistent cycles are shown. In the persistence diagrams, the gray line indicates the persistence threshold of 20 and the red line marks the threshold of 5. There are 2 points above the gray line in the pan-GATA6 patch and 3 points in the HA patch; the corresponding representative cycles of these points are shown in the rightmost column. Note that the green cycle lies within the region enclosed by the blue cycle in the HA patch.
Fig. 6
Fig. 6
The topological structures of the pan-GATA6 and HA populations are statistically different according to our results. This difference may be due to cellular organization and intracellular and/or intercellular communication in response to morphogen gradients. (a) Immunofluorescence images of co-stained pan-GATA6 and HA antibodies at 25 ng/ml Dox concentration (scale bar, formula image). The greater presence of cycles in the HA group which are more persistent suggests that cells whose differentiation is synthetically induced may be in close proximity due to chemotaxis or mitosis; we show illustrations of these mechanisms on the right (b and c). (b) We propose that morphogen-mediated signaling driven by heterogeneity in endogenous GATA6 levels could give rise to less persistent holes in the pan-GATA6 group. (c) An alternate hypothesis is that cell division alone allows HAhighNANOGlow cells to stay close to one another.

Similar articles

Cited by

References

    1. Sanderson, M. J., Smith, I., Parker, I. & Bootman, M. D. Fluorescence microscopy. Cold Spring Harb. Protoc.10, 1042–1065. 10.1101/pdb.top071795 (2014). - PMC - PubMed
    1. Ettinger, A. & Wittmann, T. Quantitative imaging in cell biology. In Fluorescence live cell imaging Vol. 123 of Methods in Cell Biology (eds Waters, J. C. & Wittman, T.) 77–94 (Academic Press, 2014). 10.1016/B978-0-12-420138-5.00005-7. - PMC - PubMed
    1. Bannon, D. et al. DeepCell kiosk: scaling deep learning-enabled cellular image analysis with Kubernetes. Nat. Methods18, 43–45 (2021). - PMC - PubMed
    1. Amitay, Y. et al. Cell Sighter: A neural network to classify cells in highly multiplexed images. Nat. Commun.10.1038/s41467-023-40066-7 (2023). - PMC - PubMed
    1. Greenwald, N. F. et al. Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning. Nat. Biotechnol.40, 555–565 (2022). - PMC - PubMed

MeSH terms

LinkOut - more resources