This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2024 Sep 19:2024.09.15.613139.

doi: 10.1101/2024.09.15.613139.

CelloType: A Unified Model for Segmentation and Classification of Tissue Images

Minxing Pang¹, Tarun Kanti Roy², Xiaodong Wu^{3

4}, Kai Tan^{5

6

7}

Affiliations

¹ Applied Mathematics & Computational Science Graduate Group, University of Pennsylvania, Philadelphia, PA, USA.
² Department of Computer Science, The University of Iowa, Iowa City, IA, USA.
³ Department of Electrical and Computer Engineering, The University of Iowa, Iowa City, IA, USA.
⁴ Department of Radiation Oncology, University of Iowa, Iowa City, IA, USA.
⁵ Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
⁶ Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
⁷ Center for Single Cell Biology, Children's Hospital of Philadelphia, Philadelphia, PA, USA.

PMID: 39345491
PMCID: PMC11429831
DOI: 10.1101/2024.09.15.613139

CelloType: A Unified Model for Segmentation and Classification of Tissue Images

Minxing Pang et al. bioRxiv. 2024.

[Preprint]. 2024 Sep 19:2024.09.15.613139.

doi: 10.1101/2024.09.15.613139.

Authors

Minxing Pang¹, Tarun Kanti Roy², Xiaodong Wu^{3

4}, Kai Tan^{5

6

7}

Affiliations

¹ Applied Mathematics & Computational Science Graduate Group, University of Pennsylvania, Philadelphia, PA, USA.
² Department of Computer Science, The University of Iowa, Iowa City, IA, USA.
³ Department of Electrical and Computer Engineering, The University of Iowa, Iowa City, IA, USA.
⁴ Department of Radiation Oncology, University of Iowa, Iowa City, IA, USA.
⁵ Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
⁶ Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
⁷ Center for Single Cell Biology, Children's Hospital of Philadelphia, Philadelphia, PA, USA.

PMID: 39345491
PMCID: PMC11429831
DOI: 10.1101/2024.09.15.613139

Update in

CelloType: a unified model for segmentation and classification of tissue images.
Pang M, Roy TK, Wu X, Tan K. Pang M, et al. Nat Methods. 2025 Feb;22(2):348-357. doi: 10.1038/s41592-024-02513-1. Epub 2024 Nov 22. Nat Methods. 2025. PMID: 39578628 Free PMC article.

Abstract

Cell segmentation and classification are critical tasks in spatial omics data analysis. We introduce CelloType, an end-to-end model designed for cell segmentation and classification of biomedical microscopy images. Unlike the traditional two-stage approach of segmentation followed by classification, CelloType adopts a multi-task learning approach that connects the segmentation and classification tasks and simultaneously boost the performance of both tasks. CelloType leverages Transformer-based deep learning techniques for enhanced accuracy of object detection, segmentation, and classification. It outperforms existing segmentation methods using ground-truths from public databases. In terms of classification, CelloType outperforms a baseline model comprised of state-of-the-art methods for individual tasks. Using multiplexed tissue images, we further demonstrate the utility of CelloType for multi-scale segmentation and classification of both cellular and non-cellular elements in a tissue. The enhanced accuracy and multi-task-learning ability of CelloType facilitate automated annotation of rapidly growing spatial omics data.

PubMed Disclaimer

Conflict of interest statement

Competing interests The authors declare no competing interests.

Figures

**Figure 1 –. Overview of CelloType.**
a) Overall architecture, input, and output of CelloType. First, a Transformer-based feature extractor is employed to derive multi-scale features ( $C_{b}$ ) from the image. Second, using a Transformer-based architecture, the DINO object detection module extracts latent features ( $C_{e}$ ) and query embeddings ( $q_{c}$ ) that are combined to generate object detection boxes with cell type labels. Subsequently, the MaskDINO module integrates the extracted image features with DINO’s outputs, resulting in detailed instance segmentation and cell type classification. During training, the model is optimized based on an overall loss function ( $L o s s$ ) that considers losses based on cell segmentation mask ${(λ}_{m a s k} L_{m a s k})$ , bounding box ( $λ_{b o x} L_{b o x}$ ), and cell type label $(λ_{c l s} L_{c l s})$ . b) Input, output, and architecture of the DINO module. The DINO module consists of a multi-layer Transformer and multiple prediction heads. DINO starts by flattening the multi-scale features from the Transformer-based feature extractor. These features are merged with positional embeddings to preserve spatial context (step 1 in the figure). DINO then employs a mixed query selection strategy, initializing positional queries $(Q_{p o s})$ as anchor detection boxes and maintaining content queries $(Q_{c o n t e n t})$ as learnable features, thus adapting to the diverse characteristics of cells (step 2). The model refines these anchor boxes through decoder layers using deformable attention mechanism and employs contrastive denoising training by introducing noise to ground truth (GT) labels and boxes to improve robustness and accuracy. Then a linear projection acts as the classification branch to produce the classification results for each box (step 3). c) Multi-scale ability of CelloType. CelloType is versatile and can perform a range of end-to-end tasks at different scales, including cell segmentation, nuclear segmentation, microanatomical structure segmentation, and full instance segmentation with corresponding class annotations.

**Figure 2 –. Evaluation of segmentation accuracy using TissueNet datasets**
a) Average Precision (AP) across Intersection over Union (IoU) thresholds for cell segmentation by Mesmer, Cellpose2, CelloType and CelloType_C (CelloType with confidence score). Mean AP value across IoU thresholds of 0.5–0.9 (mAP) for each method is indicated in parenthesis. b) AP across IoU thresholds for nuclear segmentation. c) Performance of methods stratified by imaging platform and tissue type. The top left heatmap shows the mAP scores for cell segmentation stratified by imaging platform, including CODEX, CyCIF, IMC, MIBI, MxIF and Vertra. The top right heatmap shows the mAP scores for cell segmentation stratified by tissue type, including breast, gastrointestinal, immune, pancreas and skin. The second row of heatmaps shows the mAP values for nuclear segmentation. d) Representative examples of cell segmentation of immune tissue imaged using Vectra platform. Blue, nuclear channel; green,membrane channel; white, cell boundary. The red box highlights a representative region that the methods perform differently. The AP75 score (Average precision at IoU threshold of 0.75) is displayed on the images. e) Representative examples of nuclear segmentation of gastrointestinal tissue using the IMC platform. The AP50 scores are shown on the images.

**Figure 3 –. Evaluation of segmentation accuracy using Cellpose Cyto dataset**
a) Average precision (AP) across Intersection over Union (IoU) thresholds for Cellpose2, CelloType and CelloType_C (CelloType with confidence score). Mean AP value across IoU thresholds of 0.5–0.9 (mAP) for each method is indicated in parenthesis. b) Mean AP values of Cellpose2, CelloType, and CelloType_C stratified by imaging modalities and cell types. The test dataset comprises microscopy and non-microscopy images from the Cellpose Cyto dataset that comprises 6 subsets, including Cells (Cell Image Library), Cells (Fluorecent), Cells (Non-fluorecent), Cells (Membrane), Other microscopy, and Non-microscopy. c) Representative examples of cell segmentation of a microscopy image by the compared methods. The red boxes highlight a representative region that the methods perform differently. The AP75 score is displayed on the images. d) Representative examples of cell segmentation of a non-fluorescent image by the compared methods.

**Figure 4 –. CelloType performs joint segmentation and cell type classification.**
a) Barplot showing AP50 values for cell type annotation by the two compared methods. b) Line plot showing the relationship between classification accuracy and confidence score threshold by the two methods. c) Representative examples of cell segmentation and classification results using the colorectal cancer CODEX dataset. Each row represents a 200 by 200 pixels field of view (FOV) of a CODEX image. Each FOV shows predicted cell segmentation masks (boxes) and cell types (colors). Ground Truth, manually annotated cell types; CelloType, end-to-end cell segmentation and cell type classification; Cellpose2+CellSighter, cell segmentation by Cellpose 2 followed by cell type classification by CellSighter. Randomly selected confidence scores for cell classification computed by the two methods were displayed next to the predicted instances.

**Figure 5 –. Performance benchmarking of Cellpose2 and CellSighter.**
Each method was evaluated for its originally intended task, namely Cellpose2 for segmentation and CellSighter for cell classification. Colorectal cancer CODEX dataset was used for benchmarking purpose. a) AP value of segmentation across a range of IoU thresholds. Mean AP value (mAP) is shown in parenthesis. b) Heatmap showing the confusion matrix of CellSighter cell type classification results. Ground truth cell segmentation masks were used as input to CellSighter. Each grid in the heatmap includes an accuracy score and the count of cells. c) Barplot showing the precision scores for each class identified by the CellSighter model based on the ground truth cell segmentation mask, with an overall mean precision of 0.53.

**Figure 6 –. CelloType supports joint multi-scale segmentation and classification.**
a) Performance evaluation of CelloType stratified by cell and microanatomic structure types. The bar plot shows the mean and 95% confidence interval of AP50 values in 5-fold cross-validation experiments. b) Line plot showing the relationship between classification accuracy and confidence score threshold. c) Representative examples of multi-scale segmentation and classification using human bone marrow CODEX data. The first row of images shows an example of bone marrow area consisting of various types of smaller hematopoietic cells and much larger adipocytes. The second row of images shows an example of bone marrow area consisting of various hematopoietic cell types and microanatomic structure such as trabecula bone fragments. Randomly selected confidence scores for cell classification were displayed next to the predicted instances.

See this image and copyright information in PMC

References

1. Bressan D., Battistoni G., and Hannon G.J. (2023). The dawn of spatial omics. Science 381, eabq4964. 10.1126/science.abq4964. - DOI - PMC - PubMed
1. Rozenblatt-Rosen O., Regev A., Oberdoerffer P., Nawy T., Hupalowska A., Rood J.E., Ashenberg O., Cerami E., Coffey R.J., Demir E., et al. (2020). The Human Tumor Atlas Network: Charting Tumor Transitions across Space and Time at Single-Cell Resolution. Cell 181, 236–249. 10.1016/j.cell.2020.03.053. - DOI - PMC - PubMed
1. Hu B.C., Writing G., Snyder M.P., Lin S., Posgai A., Atkinson M., Regev A., Rood J., Rozenblatt-Rosen O., Gaffney L., et al. (2019). The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature 574, 187–192. 10.1038/s41586-019-1629-x. - DOI - PMC - PubMed
1. Greenwald N.F., Miller G., Moen E., Kong A., Kagel A., Dougherty T., Fullaway C.C., McIntosh B.J., Leow K.X., Schwartz M.S., et al. (2022). Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning. Nat Biotechnol 40, 555–565. 10.1038/s41587-021-01094-0. - DOI - PMC - PubMed
1. He K., Zhang X., Ren S., Sun Ji. (2016). Deep Residual Learning for Image Recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), (IEEE), pp. 770–778.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

CelloType: A Unified Model for Segmentation and Classification of Tissue Images

Affiliations

CelloType: A Unified Model for Segmentation and Classification of Tissue Images

Authors

Affiliations

Update in

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources