Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 11;15(1):25033.
doi: 10.1038/s41598-025-09522-w.

A hybrid YOLO-UNet3D framework for automated protein particle annotation in Cryo-ET images

Affiliations

A hybrid YOLO-UNet3D framework for automated protein particle annotation in Cryo-ET images

Ziyang Liu et al. Sci Rep. .

Abstract

Accurate localization and identification of protein complexes in cryo-electron tomography (cryo-ET) volumes are essential for understanding cellular functions and disease mechanisms. However, automated annotation of these macromolecular assemblies remains challenging due to low signal-to-noise ratios, missing wedge artifacts, heterogeneous backgrounds, and structural diversity. In this study, we present a hybrid framework integrating You Only Look Once (YOLO) object detection with UNet3D volumetric segmentation, enhanced by density-based spatial clustering of applications with noise (DBSCAN) post-processing for automated protein particle annotation in cryo-ET volumes. Our approach combines YOLO's efficient region proposal capabilities with UNet3D's powerful 3D feature extraction through a dual-branch architecture featuring optimized Spatial Pyramid Pooling-Fast (SPPF) modules and asymmetric feature splitting. Extensive experiments on the Chan Zuckerberg Initiative Imaging (CZII) cryo-ET dataset demonstrate that our method significantly outperforms existing state-of-the-art approaches, including DeepFinder, standard UNet3D, YOLOv5-3D, and 3D ResNet models, achieving a mean recall of 0.8848 and F4-score of 0.7969. The framework demonstrates robust performance across various protein particle types and imaging conditions, offering a promising technical solution for high-throughput structural biology workflows requiring accurate macromolecular annotation in cellular cryo-ET data.

Keywords: Cryo-electron tomography; Deep learning; Density clustering; Protein particle detection; UNet3D; YOLO.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Overview of the 3D Cryo-ET particle detection system. The figure shows: (Left) The complete processing pipeline for 3D Cryo-ET tomogram data, including 2D slice extraction (640formula image640), resizing operations, and YOLO-based detection with visualization of particle identification results; (a) The modified YOLO detection network architecture featuring SPPF (Spatial Pyramid Pooling Fast) modules, C2f blocks, convolution layers, and upsampling operations, culminating in the detection module for 3D coordinate and particle classification output; (b) The feature extraction and processing network showing multi-scale feature maps (64, 128, 256, 512 channels) with RCSP (Residual CSP) modules, CBM4 blocks, Conv2D layers, and FPA (Feature Pyramid Aggregation) for comprehensive particle feature analysis.
Fig. 2
Fig. 2
Key Building Blocks of the Proposed Network. (a) Residual Unit (ResUnit): Architecture of the residual unit used in the UNet3D branch. (b) Asymmetric Split-and-Fuse Block (C2f Module):Structure of the asymmetric C2f module with a 1:2 feature split and dense convolutional path. (c) Fast Spatial Pyramid Pooling (SPPF) Module: Layout of the multi-branch SPPF module with adaptive max pooling for multi-scale feature fusion.
Fig. 3
Fig. 3
Central slab from a representative tomogram of the in vitro sample with selected set of annotations. Different colors indicate different particle types.
Fig. 4
Fig. 4
Visualization of dataset structure. (a) shows tomographic image slices; (b) and (c) display 3D scatter plots of protein particle annotations in two tomograms. These visualizations highlight the volumetric complexity and distribution diversity present in the dataset.
Fig. 5
Fig. 5
Visualization of feature maps at different stages of the model. The enhanced SPPF module enables effective multi-scale feature extraction, particularly beneficial for particles of varying sizes.
Fig. 6
Fig. 6
Analysis of model performance under different noise levels and imaging conditions. The model maintains robust performance even under challenging conditions, demonstrating its practical applicability.

Similar articles

References

    1. Galaz-Montoya, J. The advent of preventive high-resolution structural histopathology by artificial-intelligence-powered cryogenic electron tomography. Front. Mol. Biosci.11, 1390858 (2024). - PMC - PubMed
    1. Chien, C.-T., Maduke, M. & Chiu, W. Single-particle cryogenic electron microscopy structure determination for membrane proteins. Curr. Opin. Struct. Biol.92, 103047 (2025). - PMC - PubMed
    1. Tegunov, D., Xue, L., Dienemann, C., Cramer, P. & Mahamid, J. Multi-particle cryo-em refinement with M visualizes ribosome-antibiotic complex at 3.5 å in cells. Nat. Methods18, 186–193 (2021). - PMC - PubMed
    1. Rose, K. et al. In situ cryo-et visualization of mitochondrial depolarization and mitophagic engulfment. bioRxiv 2025–03 (2025)
    1. Kapnulin, L., Heimowitz, A. & Sharon, N. Outlier removal in cryo-em via radial profiles. J. Struct. Biol. 108172 (2025) - PubMed

MeSH terms

LinkOut - more resources