Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2026 Jan 7;27(1):bbag066.
doi: 10.1093/bib/bbag066.

Systematic evaluation of computational methods for cell segmentation

Affiliations

Systematic evaluation of computational methods for cell segmentation

Rongrong Yang et al. Brief Bioinform. .

Abstract

Cell segmentation plays a crucial role in elucidating cell structure and function, understanding disease mechanisms, and aiding pathological diagnosis. Current surveys primarily categorize methods by their technical evolution stages, which may not fully capture the paradigm shift brought by deep learning. Moreover, their evaluation scope is largely confined to image-only approaches, overlooking the significant potential of multimodal data in enhancing cell/nucleus segmentation performance. Therefore, we propose a dual-dimensional classification framework for deep learning methods. It categorizes such methods into two types: task-oriented (e.g. semantic or instance segmentation) and data-oriented (e.g. single or multimodal inputs). Based on this, we systematically classify and summarize methods across various segmentation tasks and imaging modalities. We also develop a benchmark test that covers both single-modal and multimodal methods. This test uses five diverse datasets, among which four are from conventional microscopy and one integrates sequencing with image data. Furthermore, it assesses seven algorithms based on three dimensions: effectiveness, robustness, and efficiency. Key findings indicate that deep learning models generally outperform traditional algorithms, with their advantage becoming more pronounced when image data is integrated with sequencing information.

Keywords: cell segmentation; deep learning; image processing; nuclei segmentation; spatial transcriptome.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Quantitative comparison of six nuclei segmentation algorithms Cellpose, DeepCell, Mesmer, Splinedist, StarDist and watershed evaluated on four image-only datasets BBBC041Seg, BSST265, MoNuSeg and PanNuKe. (A–D) present the results for precision, recall, F1-score, and dice, respectively. Each boxplot illustrates the distribution of metric values for a given algorithm and dataset, while each point represents the metric value for one image of that dataset.
Figure 2
Figure 2
Evaluates the stability of six nuclei segmentation algorithms across four image-only datasets, which include BBBC041Seg, BSST265, MoNuSeg, and PanNuKe, by measuring the standard deviation of their performance metrics. (A–D) present the standard deviations for precision, recall, F1-score, and dice, respectively. Lower values indicate greater algorithmic stability on the corresponding dataset and metric.
Figure 3
Figure 3
Evaluates the stability of six nuclei segmentation algorithms across four image-only datasets, which include BBBC041Seg, BSST265, MoNuSeg, and PanNuKe, using confidence intervals of performance metrics. (A–D) present the confidence interval for precision, recall, F1-score, and dice, respectively. Narrower confidence intervals indicate better algorithmic performance for the corresponding dataset and metric.
Figure 4
Figure 4
Results of nuclei segmentation methods on image-only datasets. The first column displays the original images, and the second column shows the corresponding segmentation masks. Columns three through the last illustrate the segmentation results obtained using different nuclei segmentation methods. The red rectangle highlights region of interest, enabling detailed evaluation of each algorithm’s performance in nuclei boundary segmentation accuracy and instance segmentation.
Figure 5
Figure 5
Performance evaluation of seven nuclei segmentation methods on the multi-modal dataset MSBDS integrating image and sequencing data. (A) Shows the segmentation effectiveness. (B) Presents the stability measured by standard deviation. (C) Displays the confidence intervals across all evaluation metrics.
Figure 6
Figure 6
Results of nuclei segmentation methods on the dataset MSBDS integrating image and sequencing data. The first two columns display original images and corresponding ground truth masks, while subsequent columns present results from different segmentation algorithms. A red rectangle highlights a region of interest in the first row to facilitate detailed comparison of boundary accuracy and instance segmentation performance.
Figure 7
Figure 7
Performance visualization results of seven representative nuclei segmentation methods assessed on effectiveness (red), robustness (blue), and efficiency (green). For all colored dots, a darker shade denotes better performance. A cross indicates the method is not applicable (NA), and a red-bordered circle signifies a runtime over 100 s.
Figure 8
Figure 8
A Sankey diagram mapping the seven evaluated tools onto the proposed ‘task-target-modality-method’ taxonomy. The flow illustrates the methodological basis (from classical to deep learning) and the operational scope (modality and target) of each algorithm.
Figure 9
Figure 9
A decision-tree recommendation guide for users and developers. The flowchart assists researchers in selecting the optimal segmentation tool based on data modality (e.g. spatial omics versus standard imaging), tissue complexity, and specific task requirements (e.g. speed versus accuracy trade-offs).

References

    1. Tajbakhsh N, Jeyaseelan L, Li Q et al. Embracing imperfect datasets: a review of deep learning solutions for medical image segmentation. Med Image Anal 2020;63:101693. 10.1016/j.media.2020.101693 - DOI - PubMed
    1. Oscanoa J, Doimi F, Dyer R et al. Automated segmentation and classification of cell nuclei in immunohistochemical breast cancer images with estrogen receptor marker. In: Proceedings of the 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Piscataway, NJ: IEEE; 2016. p. 2399–402. 10.1109/EMBC.2016.7591213. - DOI - PubMed
    1. Grishagin IV. Automatic cell counting with ImageJ. Anal Biochem 2015;473:63–5. 10.1016/j.ab.2014.12.007 - DOI - PubMed
    1. Esteva A, Chou K, Yeung S et al. Deep learning-enabled medical computer vision. npj Digital Medicine 2021;4:5. 10.1038/s41746-020-00376-2 - DOI - PMC - PubMed
    1. Minopoulou I, Kleyer A, Yalcin-Mutlu M et al. Imaging in inflammatory arthritis: progress towards precision medicine. Nat Rev Rheumatol 2023;19:650–65. 10.1038/s41584-023-01016-1 - DOI - PubMed

LinkOut - more resources