Review

. 2024 Oct;42(10):1100-1109.

doi: 10.1007/s11604-024-01608-1. Epub 2024 Jun 10.

Data set terminology of deep learning in medicine: a historical review and recommendation

Affiliations

¹ Department of Diagnostic and Interventional Radiology, Graduate School of Medicine, Osaka Metropolitan University, Osaka, Japan.
² Sidney Kimmel Cancer Center, Thomas Jefferson University, Philadelphia, PA, USA.
³ Department of Radiology, Juntendo University School of Medicine, Tokyo, Japan.
⁴ Department of Radiology, Nagoya University, Nagoya, Japan.
⁵ Department of Radiology, University of Tokyo Hospital, Tokyo, Japan.
⁶ Department of Diagnostic and Interventional Radiology, Graduate School of Medicine, Osaka Metropolitan University, Osaka, Japan. ai.labo.ocu@gmail.com.
⁷ Department of Artificial Intelligence, Graduate School of Medicine, Osaka Metropolitan University, Osaka, Japan. ai.labo.ocu@gmail.com.
⁸ Center for Health Science Innovation, Osaka Metropolitan University, Osaka, Japan. ai.labo.ocu@gmail.com.

PMID: 38856878
DOI: 10.1007/s11604-024-01608-1

Review

Data set terminology of deep learning in medicine: a historical review and recommendation

Shannon L Walston et al. Jpn J Radiol. 2024 Oct.

. 2024 Oct;42(10):1100-1109.

doi: 10.1007/s11604-024-01608-1. Epub 2024 Jun 10.

Authors

Affiliations

¹ Department of Diagnostic and Interventional Radiology, Graduate School of Medicine, Osaka Metropolitan University, Osaka, Japan.
² Sidney Kimmel Cancer Center, Thomas Jefferson University, Philadelphia, PA, USA.
³ Department of Radiology, Juntendo University School of Medicine, Tokyo, Japan.
⁴ Department of Radiology, Nagoya University, Nagoya, Japan.
⁵ Department of Radiology, University of Tokyo Hospital, Tokyo, Japan.
⁶ Department of Diagnostic and Interventional Radiology, Graduate School of Medicine, Osaka Metropolitan University, Osaka, Japan. ai.labo.ocu@gmail.com.
⁷ Department of Artificial Intelligence, Graduate School of Medicine, Osaka Metropolitan University, Osaka, Japan. ai.labo.ocu@gmail.com.
⁸ Center for Health Science Innovation, Osaka Metropolitan University, Osaka, Japan. ai.labo.ocu@gmail.com.

PMID: 38856878
DOI: 10.1007/s11604-024-01608-1

Abstract

Medicine and deep learning-based artificial intelligence (AI) engineering represent two distinct fields each with decades of published history. The current rapid convergence of deep learning and medicine has led to significant advancements, yet it has also introduced ambiguity regarding data set terms common to both fields, potentially leading to miscommunication and methodological discrepancies. This narrative review aims to give historical context for these terms, accentuate the importance of clarity when these terms are used in medical deep learning contexts, and offer solutions to mitigate misunderstandings by readers from either field. Through an examination of historical documents, including articles, writing guidelines, and textbooks, this review traces the divergent evolution of terms for data sets and their impact. Initially, the discordant interpretations of the word 'validation' in medical and AI contexts are explored. We then show that in the medical field as well, terms traditionally used in the deep learning domain are becoming more common, with the data for creating models referred to as the 'training set', the data for tuning of parameters referred to as the 'validation (or tuning) set', and the data for the evaluation of models as the 'test set'. Additionally, the test sets used for model evaluation are classified into internal (random splitting, cross-validation, and leave-one-out) sets and external (temporal and geographic) sets. This review then identifies often misunderstood terms and proposes pragmatic solutions to mitigate terminological confusion in the field of deep learning in medicine. We support the accurate and standardized description of these data sets and the explicit definition of data set splitting terminologies in each publication. These are crucial methods for demonstrating the robustness and generalizability of deep learning applications in medicine. This review aspires to enhance the precision of communication, thereby fostering more effective and transparent research methodologies in this interdisciplinary field.

Keywords: Artificial intelligence; Data partition; Data splitting; Deep learning; Terminology.

PubMed Disclaimer

Cited by

Large multimodality model fine-tuned for detecting breast and esophageal carcinomas on CT: a preliminary study.
Yasaka K, Kawamura M, Sonoda Y, Kubo T, Kiryu S, Abe O. Yasaka K, et al. Jpn J Radiol. 2025 May;43(5):779-786. doi: 10.1007/s11604-024-01718-w. Epub 2024 Dec 13. Jpn J Radiol. 2025. PMID: 39668277 Free PMC article.
Deep learning for appendicitis: development of a three-dimensional localization model on CT.
Takaishi T, Kawai T, Kokubo Y, Fujinaga T, Ojio Y, Yamamoto T, Hayashi K, Owatari Y, Ito H, Hiwatashi A. Takaishi T, et al. Jpn J Radiol. 2025 Jul 16. doi: 10.1007/s11604-025-01834-1. Online ahead of print. Jpn J Radiol. 2025. PMID: 40668351
Classification of Interventional Radiology Reports into Technique Categories with a Fine-Tuned Large Language Model.
Yasaka K, Nomura T, Kamohara J, Hirakawa H, Kubo T, Kiryu S, Abe O. Yasaka K, et al. J Imaging Inform Med. 2024 Dec 13. doi: 10.1007/s10278-024-01370-w. Online ahead of print. J Imaging Inform Med. 2024. PMID: 39673010
Generation of high-resolution MPRAGE-like images from 3D head MRI localizer (AutoAlign Head) images using a deep learning-based model.
Tagawa H, Fushimi Y, Fujimoto K, Nakajima S, Okuchi S, Sakata A, Otani S, Wicaksono KP, Wang Y, Ikeda S, Ito S, Umehana M, Shimotake A, Kuzuya A, Nakamoto Y. Tagawa H, et al. Jpn J Radiol. 2025 May;43(5):761-769. doi: 10.1007/s11604-024-01728-8. Epub 2025 Jan 11. Jpn J Radiol. 2025. PMID: 39794660 Free PMC article.
Applications of artificial intelligence in interventional oncology: An up-to-date review of the literature.
Matsui Y, Ueda D, Fujita S, Fushimi Y, Tsuboyama T, Kamagata K, Ito R, Yanagawa M, Yamada A, Kawamura M, Nakaura T, Fujima N, Nozaki T, Tatsugami F, Fujioka T, Hirata K, Naganawa S. Matsui Y, et al. Jpn J Radiol. 2025 Feb;43(2):164-176. doi: 10.1007/s11604-024-01668-3. Epub 2024 Oct 2. Jpn J Radiol. 2025. PMID: 39356439 Free PMC article. Review.

See all "Cited by" articles

References

1. Ueda D, Kakinuma T, Fujita S, Kamagata K, Fushimi Y, Ito R, et al. Fairness of artificial intelligence in healthcare: review and recommendations. Jpn J Radiol. 2023;42:3–15. - PubMed - PMC - DOI
1. Yoshiura T, Kiryu S. FAIR: a recipe for ensuring fairness in healthcare artificial intelligence. Jpn J Radiol. 2024;42:1–2. - PubMed - DOI
1. Kawamura M, Kamomae T, Yanagawa M, Kamagata K, Fujita S, Ueda D, et al. Revolutionizing radiation therapy: the role of AI in clinical practice. J Radiat Res. 2023. https://doi.org/10.1093/jrr/rrad090 . - DOI - PubMed - PMC
1. Fujima N, Kamagata K, Ueda D, Fujita S, Fushimi Y, Yanagawa M, et al. Current state of artificial intelligence in clinical applications for head and neck MR imaging. Magn Reson Med Sci. 2023;22:401–14. - PubMed - PMC - DOI
1. Yanagawa M, Ito R, Nozaki T, Fujioka T, Yamada A, Fujita S, et al. New trend in artificial intelligence-based assistive technology for thoracic imaging. Radiol Med. 2023;128:1236–49. - PubMed - PMC - DOI

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Springer

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Data set terminology of deep learning in medicine: a historical review and recommendation

Affiliations

Data set terminology of deep learning in medicine: a historical review and recommendation

Authors

Affiliations

Abstract

Similar articles

Cited by

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Abstract

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

LinkOut - more resources

Full Text Sources