. 2025 May 8:13:1580502.

doi: 10.3389/fbioe.2025.1580502. eCollection 2025.

Deep ensemble learning-driven fully automated multi-structure segmentation for precision craniomaxillofacial surgery

Jiahao Bao^#¹, Zongcai Tan^#², Yifeng Sun^#³, Xinyu Xu^#⁴, Huazhen Liu⁴, Weiyi Cui¹, Yang Yang⁵, Mengjia Cheng⁶, Yiming Wang¹, Congshuang Ku¹, Yuen Ka Ho¹, Jiayi Zhu¹, Linfeng Fan^#⁷, Dahong Qian^#⁸, Shunyao Shen^#¹, Yaofeng Wen^#⁸, Hongbo Yu^#¹

Affiliations

¹ 1 Department of Oral and Craniomaxillofacial Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, College of Stomatology, Shanghai Jiao Tong University, National Center for Stomatology, National Clinical Research Center for Oral Diseases, Shanghai Research Institute of Stomatology, Shanghai Key Laboratory of Stomatology, Shanghai, China.
² Hamlyn Centre for Robotic Surgery, Institute of Global Health Innovation, Imperial College London, London, United Kingdom.
³ School of Mechanical Engineering, Shanghai Dianji University, Shanghai, China.
⁴ School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China.
⁵ Shanghai Lanhui Medical Technology Co., Ltd., Shanghai, China.
⁶ Faculty of Dentistry, The University of Hong Kong, Hong Kong, Hong Kong SAR, China.
⁷ Department of Radiology, Shanghai Ninth People's Hospital, College of Stomatology, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
⁸ School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China.

^# Contributed equally.

PMID: 40406586
PMCID: PMC12094958
DOI: 10.3389/fbioe.2025.1580502

Deep ensemble learning-driven fully automated multi-structure segmentation for precision craniomaxillofacial surgery

Jiahao Bao et al. Front Bioeng Biotechnol. 2025.

. 2025 May 8:13:1580502.

doi: 10.3389/fbioe.2025.1580502. eCollection 2025.

Authors

Affiliations

¹ 1 Department of Oral and Craniomaxillofacial Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, College of Stomatology, Shanghai Jiao Tong University, National Center for Stomatology, National Clinical Research Center for Oral Diseases, Shanghai Research Institute of Stomatology, Shanghai Key Laboratory of Stomatology, Shanghai, China.
² Hamlyn Centre for Robotic Surgery, Institute of Global Health Innovation, Imperial College London, London, United Kingdom.
³ School of Mechanical Engineering, Shanghai Dianji University, Shanghai, China.
⁴ School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China.
⁵ Shanghai Lanhui Medical Technology Co., Ltd., Shanghai, China.
⁶ Faculty of Dentistry, The University of Hong Kong, Hong Kong, Hong Kong SAR, China.
⁷ Department of Radiology, Shanghai Ninth People's Hospital, College of Stomatology, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
⁸ School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China.

^# Contributed equally.

PMID: 40406586
PMCID: PMC12094958
DOI: 10.3389/fbioe.2025.1580502

Abstract

Objectives: Accurate segmentation of craniomaxillofacial (CMF) structures and individual teeth is essential for advancing computer-assisted CMF surgery. This study developed CMF-ELSeg, a novel fully automatic multi-structure segmentation model based on deep ensemble learning.

Methods: A total of 143 CMF computed tomography (CT) scans were retrospectively collected and manually annotated by experts for model training and validation. Three 3D U-Net-based deep learning models (V-Net, nnU-Net, and 3D UX-Net) were benchmarked. CMF-ELSeg employed a coarse-to-fine cascaded architecture and an ensemble approach to integrate the strengths of these models. Segmentation performance was evaluated using Dice score and Intersection over Union (IoU) by comparing model predictions to ground truth annotations. Clinical feasibility was assessed through qualitative and quantitative analyses.

Results: In coarse segmentation of the upper skull, mandible, cervical vertebra, and pharyngeal cavity, 3D UX-Net and nnU-Net achieved Dice scores above 0.96 and IoU above 0.93. For fine segmentation and classification of individual teeth, the cascaded 3D UX-Net performed best. CMF-ELSeg improved Dice scores by 3%-5% over individual models for facial soft tissue, upper skull, mandible, cervical vertebra, and pharyngeal cavity segmentation, and maintained high accuracy Dice > 0.94 for most teeth. Clinical evaluation confirmed that CMF-ELSeg performed reliably in patients with skeletal malocclusion, fractures, and fibrous dysplasia.

Conclusion: CMF-ELSeg provides high-precision segmentation of CMF structures and teeth by leveraging multiple models, serving as a practical tool for clinical applications and enhancing patient-specific treatment planning in CMF surgery.

Keywords: computed tomography; craniomaxillofacial surgery; deep learning; segmentation; virtual surgical planning.

PubMed Disclaimer

Conflict of interest statement

Author YY was employed by Shanghai Lanhui Medical Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**FIGURE 1**
Overview of the study design.

**FIGURE 2**
Architecture of our proposed deep learning model.

**FIGURE 3**
Quantitative analysis results for segmentation performance. **(A)** Dice scores for segmentation performance of CMF structures using V-Net, nnU-Net, and 3D UX-Net. **(B)** Dice scores for segmentation performance of individual teeth using cascaded segmentation networks based on V-Net, nnU-Net, and 3D UX-Net.

**FIGURE 4**
Segmentation performance of CMF-ELSeg. **(A)** Quantitative analysis results for segmentation performance of CMF structures and individual teeth. **(B)** Comparison of segmentation performance between CMF-ELSeg and the baseline models. *P < 0.05.

**FIGURE 5**
Segmentation and 3D reconstruction results of CMF structures and individual teeth using CMF-ELSeg. **(A,B)** Segmentation results illustrated for two representative cases. **(C,D)** 3D reconstruction results illustrated for two representative cases. Case 1: a skeletal class III malocclusion patient with orthodontic brackets. Case 2: a patient who has undergone orthognathic surgery.

**FIGURE 6**
Segmentation results of individual teeth using CMF-ELSeg and individual cascaded segmentation network. Case 1: a skeletal class III malocclusion patient with orthodontic brackets. Case 2: a patient who has undergone orthognathic surgery.

**FIGURE 7**
3D reconstruction results and surface deviations of individual teeth using CMF-ELSeg and individual cascaded segmentation network. Case 1: a skeletal class III malocclusion patient with orthodontic brackets. Case 2: a patient who has undergone orthognathic surgery.

**FIGURE 8**
Clinical feasibility evaluation of CMF-ELSeg. **(A)** An example of the segmentation and reconstruction results using CMF-ELSeg for patients with skeletal malocclusion. **(B)** The qualitative analysis results of CMF-ELSeg in Cohort 2. **(C)** Quantitative analysis results of CMF-ELSeg in Cohort 2. **(D)** The composition of patients in Cohort 3. **(E)** The qualitative analysis results of CMF-ELSeg in Cohort 3. **(F)** Segmentation and reconstruction cases of CMF-ELSeg in Cohort 3.

See this image and copyright information in PMC

References

1. Alkhayer A., Piffkó J., Lippold C., Segatto E. (2020). Accuracy of virtual planning in orthognathic surgery: a systematic review. Head and Face Med. 16, 34. 10.1186/s13005-020-00250-2 - DOI - PMC - PubMed
1. Bao J., Zhang X., Xiang S., Liu H., Cheng M., Yang Y., et al. (2024). Deep learning-based facial and skeletal transformations for surgical planning. J. Dent. Res. 103, 809–819. 10.1177/00220345241253186 - DOI - PubMed
1. Berroukham A., Housni K., Lahraichi M. (2023). “Vision transformers: a review of architecture, applications, and future directions,” in 2023 7th IEEE congress on information science and Technology (CiSt), 205–210. 10.1109/CiSt56084.2023.10410015 - DOI
1. Chen X., Liu Q., Deng H. H., Kuang T., Lin H. H.-Y., Xiao D., et al. (2024). Improving image segmentation with contextual and structural similarity. Pattern Recognit. 152, 110489. 10.1016/j.patcog.2024.110489 - DOI - PMC - PubMed
1. Chen X., Wang X., Zhang K., Fung K.-M., Thai T. C., Moore K., et al. (2022). Recent advances and clinical applications of deep learning in medical image analysis. Med. Image Anal. 79, 102444. 10.1016/j.media.2022.102444 - DOI - PMC - PubMed

LinkOut - more resources

Full Text Sources
- Frontiers Media SA
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Deep ensemble learning-driven fully automated multi-structure segmentation for precision craniomaxillofacial surgery

Affiliations

Deep ensemble learning-driven fully automated multi-structure segmentation for precision craniomaxillofacial surgery

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources