Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 12;15(1):25205.
doi: 10.1038/s41598-025-06229-w.

Multimodal deep learning for cephalometric landmark detection and treatment prediction

Affiliations

Multimodal deep learning for cephalometric landmark detection and treatment prediction

Fei Gao et al. Sci Rep. .

Abstract

In orthodontics and maxillofacial surgery, accurate cephalometric analysis and treatment outcome prediction are critical for clinical decision-making. Traditional approaches rely on manual landmark identification, which is time-consuming and subject to inter-observer variability, while existing automated methods typically utilize single imaging modalities with limited accuracy. This paper presents DeepFuse, a novel multi-modal deep learning framework that integrates information from lateral cephalograms, CBCT volumes, and digital dental models to simultaneously perform landmark detection and treatment outcome prediction. The framework employs modality-specific encoders, an attention-guided fusion mechanism, and dual-task decoders to leverage complementary information across imaging techniques. Extensive experiments on three clinical datasets demonstrate that DeepFuse achieves a mean radial error of 1.21 mm for landmark detection, representing a 13% improvement over state-of-the-art methods, with a clinical acceptability rate of 92.4% at the 2 mm threshold. For treatment outcome prediction, the framework attains an overall accuracy of 85.6%, significantly outperforming both conventional prediction models and experienced clinicians. The proposed approach enhances diagnostic precision and treatment planning while providing interpretable visualization of decision factors, demonstrating significant potential for clinical integration in orthodontic and maxillofacial practice.

Keywords: Attention mechanism; Cephalometric analysis; Landmark detection; Multi-modal deep learning; Orthodontics; Treatment outcome prediction.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Overall architecture of the DeepFuse framework, illustrating modality-specific encoders (left), attention-guided fusion module (center), and dual-task decoders (right). The framework processes multiple input modalities including lateral cephalograms, CBCT volumes, and digital dental models to simultaneously perform landmark detection and treatment outcome prediction.
Fig. 2
Fig. 2
Multi-source data preprocessing pipeline illustrating modality-specific preprocessing steps including CLAHE for cephalograms, artifact reduction for CBCT, and mesh simplification for dental models, followed by standardization and registration processes.
Fig. 3
Fig. 3
Automated registration flowchart showing landmark-based and intensity-based alignment.
Fig. 4
Fig. 4
Architecture of the multi-modal feature extraction and fusion mechanism in DeepFuse. The figure shows modality-specific encoders (left), cross-modal alignment (center), and the attention-guided fusion module (right) with connections to downstream task-specific decoders.
Fig. 5
Fig. 5
Comparison of mean detection errors (mm) for critical cephalometric landmarks across different methods. DeepFuse consistently demonstrates lower error rates, with particular improvement for traditionally challenging landmarks (PTM, Go, Or).
Fig. 6
Fig. 6
Grad-CAM visualizations for treatment outcome predictions across different case types. Warmer colors indicate regions with greater influence on prediction outcomes.

References

    1. Chen, Y. et al. Validity of digital lateral cephalometry: a systematic review. J. Orthodont.47 (1), 51–66 (2020).
    1. Kapila, S. D. & Nervina, J. M. CBCT in orthodontics: assessment of treatment outcomes and indications for its use. Dentomaxillofacial Radiol.44 (1), 20140282 (2015). - PMC - PubMed
    1. Leonardi, R. et al. Automatic cephalometric analysis: a systematic review. Angle Orthod.78 (1), 145–151 (2008). - PubMed
    1. Proffit, W. R., Fields, H. W. & Sarver, D. M. Contemporary Orthodontics 6th edn 174–208 (Elsevier, 2019).
    1. Trpkova, B. et al. Cephalometric landmarks identification and reproducibility: a meta analysis. Am. J. Orthod. Dentofac. Orthop.112 (2), 165–170 (1997). - PubMed

MeSH terms