Comparative Study

. 2024 May;34(2):242-257.

doi: 10.1016/j.zemedi.2023.01.008. Epub 2023 Mar 15.

Artificial intelligence-based analysis of whole-body bone scintigraphy: The quest for the optimal deep learning algorithm and comparison with human observer performance

Ghasem Hajianfar¹, Maziar Sabouri², Yazdan Salimi¹, Mehdi Amini¹, Soroush Bagheri³, Elnaz Jenabi⁴, Sepideh Hekmat⁵, Mehdi Maghsudi³, Zahra Mansouri¹, Maziar Khateri⁶, Mohammad Hosein Jamshidi⁷, Esmail Jafari⁸, Ahmad Bitarafan Rajabi³, Majid Assadi⁸, Mehrdad Oveisi⁹, Isaac Shiri¹, Habib Zaidi¹⁰

Affiliations

¹ Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva 4, Switzerland.
² Department of Medical Physics, School of Medicine, Iran University of Medical Science, Tehran, Iran; Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran.
³ Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran.
⁴ Research Center for Nuclear Medicine, Shariati Hospital, Tehran University of Medical Sciences, Tehran, Iran.
⁵ Hasheminejad Hospital, Iran University of Medical Sciences, Tehran, Iran.
⁶ Department of Medical Radiation Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.
⁷ Department of Medical Imaging and Radiation Sciences, School of Allied Medical Sciences, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran.
⁸ The Persian Gulf Nuclear Medicine Research Center, Department of Molecular Imaging and Radionuclide Therapy, Bushehr Medical University Hospital, School of Medicine, Bushehr University of Medical Sciences, Bushehr, Iran.
⁹ Department of Computer Science, University of British Columbia, Vancouver, BC, Canada.
¹⁰ Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva 4, Switzerland; Geneva University Neurocenter, Geneva University, Geneva, Switzerland; Department of Nuclear Medicine and Molecular Imaging, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands; Department of Nuclear Medicine, University of Southern Denmark, Odense, Denmark. Electronic address: habib.zaidi@hcuge.ch.

PMID: 36932023
PMCID: PMC11156776
DOI: 10.1016/j.zemedi.2023.01.008

Comparative Study

Artificial intelligence-based analysis of whole-body bone scintigraphy: The quest for the optimal deep learning algorithm and comparison with human observer performance

Ghasem Hajianfar et al. Z Med Phys. 2024 May.

. 2024 May;34(2):242-257.

doi: 10.1016/j.zemedi.2023.01.008. Epub 2023 Mar 15.

Authors

Affiliations

¹ Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva 4, Switzerland.
² Department of Medical Physics, School of Medicine, Iran University of Medical Science, Tehran, Iran; Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran.
³ Rajaie Cardiovascular Medical and Research Center, Iran University of Medical Sciences, Tehran, Iran.
⁴ Research Center for Nuclear Medicine, Shariati Hospital, Tehran University of Medical Sciences, Tehran, Iran.
⁵ Hasheminejad Hospital, Iran University of Medical Sciences, Tehran, Iran.
⁶ Department of Medical Radiation Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.
⁷ Department of Medical Imaging and Radiation Sciences, School of Allied Medical Sciences, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran.
⁸ The Persian Gulf Nuclear Medicine Research Center, Department of Molecular Imaging and Radionuclide Therapy, Bushehr Medical University Hospital, School of Medicine, Bushehr University of Medical Sciences, Bushehr, Iran.
⁹ Department of Computer Science, University of British Columbia, Vancouver, BC, Canada.
¹⁰ Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva 4, Switzerland; Geneva University Neurocenter, Geneva University, Geneva, Switzerland; Department of Nuclear Medicine and Molecular Imaging, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands; Department of Nuclear Medicine, University of Southern Denmark, Odense, Denmark. Electronic address: habib.zaidi@hcuge.ch.

PMID: 36932023
PMCID: PMC11156776
DOI: 10.1016/j.zemedi.2023.01.008

Abstract

Purpose: Whole-body bone scintigraphy (WBS) is one of the most widely used modalities in diagnosing malignant bone diseases during the early stages. However, the procedure is time-consuming and requires vigour and experience. Moreover, interpretation of WBS scans in the early stages of the disorders might be challenging because the patterns often reflect normal appearance that is prone to subjective interpretation. To simplify the gruelling, subjective, and prone-to-error task of interpreting WBS scans, we developed deep learning (DL) models to automate two major analyses, namely (i) classification of scans into normal and abnormal and (ii) discrimination between malignant and non-neoplastic bone diseases, and compared their performance with human observers.

Materials and methods: After applying our exclusion criteria on 7188 patients from three different centers, 3772 and 2248 patients were enrolled for the first and second analyses, respectively. Data were split into two parts, including training and testing, while a fraction of training data were considered for validation. Ten different CNN models were applied to single- and dual-view input (posterior and anterior views) modes to find the optimal model for each analysis. In addition, three different methods, including squeeze-and-excitation (SE), spatial pyramid pooling (SPP), and attention-augmented (AA), were used to aggregate the features for dual-view input models. Model performance was reported through area under the receiver operating characteristic (ROC) curve (AUC), accuracy, sensitivity, and specificity and was compared with the DeLong test applied to ROC curves. The test dataset was evaluated by three nuclear medicine physicians (NMPs) with different levels of experience to compare the performance of AI and human observers.

Results: DenseNet121_AA (DensNet121, with dual-view input aggregated by AA) and InceptionResNetV2_SPP achieved the highest performance (AUC = 0.72) for the first and second analyses, respectively. Moreover, on average, in the first analysis, Inception V3 and InceptionResNetV2 CNN models and dual-view input with AA aggregating method had superior performance. In addition, in the second analysis, DenseNet121 and InceptionResNetV2 as CNN methods and dual-view input with AA aggregating method achieved the best results. Conversely, the performance of AI models was significantly higher than human observers for the first analysis, whereas their performance was comparable in the second analysis, although the AI model assessed the scans in a drastically lower time.

Conclusion: Using the models designed in this study, a positive step can be taken toward improving and optimizing WBS interpretation. By training DL models with larger and more diverse cohorts, AI could potentially be used to assist physicians in the assessment of WBS images.

Keywords: Artificial intelligence; Bone; Deep learning; Scintigraphy; Whole-body.

PubMed Disclaimer

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

**Figure 1**
Flowchart of inclusion and exclusion criteria.

**Figure 2**
An instance of normal and pathological cases according to nuclear medicine physicians’ reports.

**Figure 3**
Workflow of applied deep learning models. Ant: anterior, Post: posterior, SPP: spatial pyramid pooling, SE: squeeze-and-excitation, AA: attention-augmented.

**Figure 4**
Performance of the various models used in the first analysis in terms of accuracy, AUC, sensitivity, and specificity. Ant: anterior, Post: posterior, SPP: spatial pyramid pooling, SE: squeeze-and-excitation, AA: attention-augmented.

**Figure 5**
(a-e) ROC curves of the best 5 models for the first analysis. (f) Comparison between the ROC curves achieved by nuclear medicine physicians (NMPs) and the DL model achieving the highest AUC (DenseNet121_AA). Ant: anterior, Post: posterior, SPP: spatial pyramid pooling, SE: squeeze-and-excitation, AA: attention-augmented.

**Figure 6**
Performance of the various models used in the second analysis regarding the accuracy, AUC, sensitivity, and specificity. Ant: anterior, Post: posterior, SPP: spatial pyramid pooling, SE: squeeze-and-excitation, AA: attention-augmented.

**Figure 7**
(a-e) ROC curves of the best 5 models for the second analysis. (f) Comparison between the ROC curves achieved by nuclear medicine physicians (NMPs) and the best-performing DL model (InceptionResNetV2_SPP). Ant: anterior, Post: posterior, SPP: spatial pyramid pooling, SE: squeeze-and-excitation, AA: attention-augmented.

**Figure 8**
Model performance is compared using the DeLong test for the first strategy, which is run on the models’ AUCs. The models on columns and rows were evaluated against each other. Light blue: if the row model outperformed significantly the column model in terms of p-value. Purple: if the comparison between the row model and column model yielded a non-significant p-value. Ant: anterior, Post: posterior, SPP: spatial pyramid pooling, SE: squeeze-and-excitation, AA: attention-augmented.

**Figure 9**
Model performance is compared using the DeLong test for the second strategy, which is run on the models’ AUCs. The models on columns and rows were evaluated against each other. Light blue: if the row model significantly outperformed the column model in terms of p-value. Purple: if the comparison between the row model and column model yielded a non-significant p-value. Ant: anterior, Post: posterior, SPP: spatial pyramid pooling, SE: squeeze-and-excitation, AA: attention-augmented.

See this image and copyright information in PMC

Cited by

Application of Nuclear Medicine Techniques in Musculoskeletal Infection: Current Trends and Future Prospects.
Valero-Martínez C, Castillo-Morales V, Gómez-León N, Hernández-Pérez I, Vicente-Rabaneda EF, Uriarte M, Castañeda S. Valero-Martínez C, et al. J Clin Med. 2024 Feb 13;13(4):1058. doi: 10.3390/jcm13041058. J Clin Med. 2024. PMID: 38398371 Free PMC article. Review.
Artificial intelligence in medical physics.
Bollmann S, Küstner T, Tao Q, Zöllner FG. Bollmann S, et al. Z Med Phys. 2024 May;34(2):177-178. doi: 10.1016/j.zemedi.2024.03.002. Epub 2024 Mar 23. Z Med Phys. 2024. PMID: 38523040 Free PMC article. No abstract available.
Artificial intelligence-based cardiac transthyretin amyloidosis detection and scoring in scintigraphy imaging: multi-tracer, multi-scanner, and multi-center development and evaluation study.
Salimi Y, Shiri I, Mansouri Z, Sanaat A, Hajianfar G, Hervier E, Bitarafan A, Caobelli F, Hundertmark M, Mainta I, Gräni C, Nkoulou R, Zaidi H. Salimi Y, et al. Eur J Nucl Med Mol Imaging. 2025 Jun;52(7):2513-2528. doi: 10.1007/s00259-025-07117-1. Epub 2025 Feb 5. Eur J Nucl Med Mol Imaging. 2025. PMID: 39907796 Free PMC article.
Does the tail show when the nose knows? Artificial intelligence outperforms human experts at predicting detection dogs finding their target through tail kinematics.
Martvel G, Pedretti G, Lazebnik T, Zamansky A, Ouchi Y, Monteiro T, Farhat N, Shimshoni I, Michaeli Y, Valsecchi P, Hall N, Marshall-Pescini S, Grinstein D. Martvel G, et al. R Soc Open Sci. 2025 Aug 13;12(8):250399. doi: 10.1098/rsos.250399. eCollection 2025 Aug. R Soc Open Sci. 2025. PMID: 40809360 Free PMC article.
Clinical performance of deep learning-enhanced ultrafast whole-body scintigraphy in patients with suspected malignancy.
Qi N, Pan B, Meng Q, Yang Y, Ding J, Yuan Z, Gong NJ, Zhao J. Qi N, et al. BMC Med Imaging. 2024 Sep 9;24(1):236. doi: 10.1186/s12880-024-01422-1. BMC Med Imaging. 2024. PMID: 39251959 Free PMC article.

See all "Cited by" articles

References

1. Van den Wyngaert T., Strobel K., Kampen W.U., Kuwert T., van der Bruggen W., Mohan H.K., et al. The EANM practice guidelines for bone scintigraphy. Eur J Nucl Med Mol Imaging. 2016;43:1723–1738. - PMC - PubMed
1. O'Connor M.K., Brown M.L., Hung J.C., Hayostek R.J. The art of bone scintigraphy–technical aspects. J Nucl Med. 1991;32:2332–2341. - PubMed
1. Savelli G., Maffioli L., Maccauro M., De Deckere E., Bombardieri E. Bone scintigraphy and the added value of SPECT (single photon emission tomography) in detecting skeletal lesions. Q J Nucl Med. 2001;45:27–37. - PubMed
1. Fogelman I., Gnanasegaran G. Springer; 2013. Van der Wall H. Radionuclide and hybrid bone imaging. - PubMed
1. Ryan P., Fogelman I. Bone scintigraphy in metabolic bone disease. Semin Nucl Med. 1997:291–305. - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Artificial intelligence-based analysis of whole-body bone scintigraphy: The quest for the optimal deep learning algorithm and comparison with human observer performance

Affiliations

Artificial intelligence-based analysis of whole-body bone scintigraphy: The quest for the optimal deep learning algorithm and comparison with human observer performance

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous