. 2025 Jul 29:6:1634006.

doi: 10.3389/fdmed.2025.1634006. eCollection 2025.

Evaluating the accuracy of generative artificial intelligence models in dental age estimation based on the Demirjian's method

Affiliations

¹ Post-Graduation Program in Health and Environment, University from the Joinville Region - Univille, Joinville, Brazil.
² School of Dentistry, Tuiuti University of Paraná - UTP, Curitiba, Brazil.
³ Department of Biomaterials, University of Uberaba - UNIUBE, Uberaba, Brazil.
⁴ Department of Orthodontics, University Hospital Bonn, Medical Faculty, Bonn, Germany.
⁵ Postgraduate Program in Dentistry, Health Institute of Nova Friburgo, Fluminense Federal University, Niterói, Rio de Janeiro, Brazil.

PMID: 40800006
PMCID: PMC12339434
DOI: 10.3389/fdmed.2025.1634006

Evaluating the accuracy of generative artificial intelligence models in dental age estimation based on the Demirjian's method

Allan Abuabara et al. Front Dent Med. 2025.

. 2025 Jul 29:6:1634006.

doi: 10.3389/fdmed.2025.1634006. eCollection 2025.

Authors

Affiliations

¹ Post-Graduation Program in Health and Environment, University from the Joinville Region - Univille, Joinville, Brazil.
² School of Dentistry, Tuiuti University of Paraná - UTP, Curitiba, Brazil.
³ Department of Biomaterials, University of Uberaba - UNIUBE, Uberaba, Brazil.
⁴ Department of Orthodontics, University Hospital Bonn, Medical Faculty, Bonn, Germany.
⁵ Postgraduate Program in Dentistry, Health Institute of Nova Friburgo, Fluminense Federal University, Niterói, Rio de Janeiro, Brazil.

PMID: 40800006
PMCID: PMC12339434
DOI: 10.3389/fdmed.2025.1634006

Abstract

Introduction: Dental age estimation plays a key role in forensic identification, clinical diagnosis, treatment planning, and prognosis in fields such as pediatric dentistry and orthodontics. Large language models (LLM) are increasingly being recognized for their potential applications in Dentistry. This study aimed to compare the performance of currently available generative artificial intelligence LLM technologies in estimating dental age using the Demirjian's scores.

Methods: Panoramic radiographs were analyzed using Demirjian's method (1973), with each left permanent mandibular tooth classified from stage A to H. Untrained LLM, ChatGPT (GPT-4-turbo), Gemini 2.0 Flash, and DeepSeek-V3 were tasked with estimating dental age based on the patient's Demirjian score for each tooth. Due to the probabilistic nature of ChatGPT, Gemini, and DeepSeek, which can produce varying responses to the same question, three responses were collected per case per day (three different computers) from each model on three separate days. The age estimates obtained from LLM were compared to the individuals' chronological ages. Intra- and inter-examiner reliability was assessed using the Intraclass Correlation Coefficient (ICC). Model performance was evaluated using Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Coefficient of Determination (R ²), and Bias.

Results: Thirty panoramic radiographs (40% female, 60% male; mean age 10.4 ± 2.32 years) were included. Both intra- and inter-examiner ICC values exceeded 0.85. ChatGPT and DeepSeek exhibited comparable but suboptimal performance, with higher errors (MAE: 1.98-2.05 years; RMSE: 2.33-2.35 years), negative R ² values (-0.069 to -0.049), and substantial overestimation biases (1.90-1.91 years), indicating poor model fit and systematic flaws. Gemini demonstrated intermediate results, with a moderate MAE (1.57 years) and RMSE (1.81 years), a positive R ² (0.367), and a lower bias (1.32 years).

Discussion: This study demonstrated that, although LLM like ChatGPT, Gemini, and DeepSeek can estimate dental age using Demirjian's scores, their performance remains inferior to the traditional method. Among them, DeepSeek-V3 showed the best results, but all models require task-specific training and validation before clinical application.

Keywords: age determination by teeth; artificial intelligence; clinical decision-making; evidence-based dentistry; generative artificial intelligence; large language models.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Workflow for evaluating large language models (LLM) in dental age estimation using the Demirjian's method. Models were prompted and compared (ChatGPT, Gemini, DeepSeek), with performance assessed by Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Coefficient of Determination (R²), and bias. AI: Artificial Intelligence.

**Figure 2**
Scatter plot comparing chronological age and predicted dental age estimated by the Demirjian's method and three LLM (ChatGPT, Gemini, and DeepSeek). The red dashed line represents the ideal prediction (y = x).

See this image and copyright information in PMC

References

1. Kirschneck C, Proff P. Age assessment in orthodontics and general dentistry. Quintessence Int. (2018) 49(4):313–23. 10.3290/j.qi.a39960 - DOI - PubMed
1. Shen S, Liu Z, Wang J, Fan L, Ji F, Tao J. Machine learning assisted Cameriere method for dental age estimation. BMC Oral Health. (2021) 21(1):641. 10.1186/s12903-021-01996-0 - DOI - PMC - PubMed
1. Cortés MM P, Rojo R, Alía García E, Mourelle Martínez MR. Accuracy assessment of dental age estimation with the Willems, Demirjian and Nolla methods in Spanish children: comparative cross-sectional study. BMC Pediatr. (2020) 20(1):361. 10.1186/s12887-020-02247-x - DOI - PMC - PubMed
1. Demirjian A, Goldstein H, Tanner JM. A new system of dental age assessment. Hum Biol. (1973) 45(2):211–27. - PubMed
1. Zheng J, Ding X, Pu JJ, Chung SM, Ai QYH, Hung KF, et al. Unlocking the potentials of large language models in orthodontics: a scoping review. Bioengineering (Basel. (2024) 11(11):1145. 10.3390/bioengineering11111145 - DOI - PMC - PubMed

Associated data

figshare/10.6084/m9.figshare.29045501.v1

LinkOut - more resources

Full Text Sources
- Frontiers Media SA
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Evaluating the accuracy of generative artificial intelligence models in dental age estimation based on the Demirjian's method

Affiliations

Evaluating the accuracy of generative artificial intelligence models in dental age estimation based on the Demirjian's method

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

Associated data

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

References

Associated data

Related information

LinkOut - more resources

Full Text Sources