Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr;38(2):1112-1119.
doi: 10.1007/s10278-024-01248-x. Epub 2024 Sep 4.

Improving the Annotation Process in Computational Pathology: A Pilot Study with Manual and Semi-automated Approaches on Consumer and Medical Grade Devices

Affiliations

Improving the Annotation Process in Computational Pathology: A Pilot Study with Manual and Semi-automated Approaches on Consumer and Medical Grade Devices

Giorgio Cazzaniga et al. J Imaging Inform Med. 2025 Apr.

Abstract

The development of reliable artificial intelligence (AI) algorithms in pathology often depends on ground truth provided by annotation of whole slide images (WSI), a time-consuming and operator-dependent process. A comparative analysis of different annotation approaches is performed to streamline this process. Two pathologists annotated renal tissue using semi-automated (Segment Anything Model, SAM)) and manual devices (touchpad vs mouse). A comparison was conducted in terms of working time, reproducibility (overlap fraction), and precision (0 to 10 accuracy rated by two expert nephropathologists) among different methods and operators. The impact of different displays on mouse performance was evaluated. Annotations focused on three tissue compartments: tubules (57 annotations), glomeruli (53 annotations), and arteries (58 annotations). The semi-automatic approach was the fastest and had the least inter-observer variability, averaging 13.6 ± 0.2 min with a difference (Δ) of 2%, followed by the mouse (29.9 ± 10.2, Δ = 24%), and the touchpad (47.5 ± 19.6 min, Δ = 45%). The highest reproducibility in tubules and glomeruli was achieved with SAM (overlap values of 1 and 0.99 compared to 0.97 for the mouse and 0.94 and 0.93 for the touchpad), though SAM had lower reproducibility in arteries (overlap value of 0.89 compared to 0.94 for both the mouse and touchpad). No precision differences were observed between operators (p = 0.59). Using non-medical monitors increased annotation times by 6.1%. The future employment of semi-automated and AI-assisted approaches can significantly speed up the annotation process, improving the ground truth for AI tool development.

Keywords: Annotation; Artificial intelligence; Computational pathology; Digital pathology; Segment Anything Model.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethical Approval: Approval was obtained from the local ethics committee (PNRR-MR1-2022–12375735, 03/16/23). Conflict of Interest: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Graphic representation of the study design. Two pathologists, one with experience in annotation and the other at its beginner phase with this task, were asked to annotate three different renal structures (tubules, glomeruli, and arteries) using two manual devices (mouse vs touchpad) and a semi-automatic, AI-based tool (SAM). Starting from the masks obtained with this annotation process, performances in terms of working time required, inter-observer variability, and annotation precision were calculated, comparing the results obtained with the mouse using medical and non-medical displays
Fig. 2
Fig. 2
The Qupath extension for SAM allows the user to draw a bounding box around the structure of interest and let the model finely annotate it
Fig. 3
Fig. 3
The inter-annotator average times required for the annotation of different structures (tubules, violet; glomeruli, pink; arteries, light pink) using different devices (touchpad, mouse, and SAM) (A). Time variability between the two annotators is reported in %, using the same stratification (B). Finally, the reproducibility in terms of overlap fraction for ROIs obtained with different methods (C) and from the two observers (D) is reported
Fig. 4
Fig. 4
Comparison of the annotation outlines for tubules, glomeruli, and arteries obtained with the different devices (touchpad, mouse, and SAM), with relative overlap of the ROIs obtained

Similar articles

Cited by

References

    1. Hanna, M. G., Ardon, O., Reuter, V. E., Sirintrapun, S. J., England, C., Klimstra, D. S. & Hameed, M. R. Integrating digital pathology into clinical practice. Mod. Pathol.35, 152–164 (2022). - PubMed
    1. Pisapia, P., L’Imperio, V., Galuppini, F., Sajjadi, E., Russo, A., Cerbelli, B., Fraggetta, F., d’Amati, G., Troncone, G., Fassan, M., Fusco, N., Pagni, F. & Malapelle, U. The evolving landscape of anatomic pathology. Crit. Rev. Oncol. Hematol.178, 103776 (2022). - PubMed
    1. L’Imperio, V., Casati, G., Cazzaniga, G., Tarabini, A., Bolognesi, M. M., Gibilisco, F., Fraggetta, F. & Pagni, F. Improvements in digital pathology equipment for renal biopsies: updating the standard model. J. Nephrol. (2023). 10.1007/s40620-023-01568-1 - PubMed
    1. Cazzaniga, G., Rossi, M., Eccher, A., Girolami, I., L’Imperio, V., Van Nguyen, H., Becker, J. U., Bueno García, M. G., Sbaraglia, M., Dei Tos, A. P., Gambaro, G. & Pagni, F. Time for a full digital approach in nephropathology: a systematic review of current artificial intelligence applications and future directions. J. Nephrol. (2023). 10.1007/s40620-023-01775-w - PMC - PubMed
    1. Niazi, M. K. K., Parwani, A. V. & Gurcan, M. N. Digital pathology and artificial intelligence. Lancet Oncol.20, e253–e261 (2019). - PMC - PubMed

LinkOut - more resources