Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov;115(11):3695-3704.
doi: 10.1111/cas.16330. Epub 2024 Sep 2.

High-quality expert annotations enhance artificial intelligence model accuracy for osteosarcoma X-ray diagnosis

Affiliations

High-quality expert annotations enhance artificial intelligence model accuracy for osteosarcoma X-ray diagnosis

Joe Hasei et al. Cancer Sci. 2024 Nov.

Abstract

Primary malignant bone tumors, such as osteosarcoma, significantly affect the pediatric and young adult populations, necessitating early diagnosis for effective treatment. This study developed a high-performance artificial intelligence (AI) model to detect osteosarcoma from X-ray images using highly accurate annotated data to improve diagnostic accuracy at initial consultations. Traditional models trained on unannotated data have shown limited success, with sensitivities of approximately 60%-70%. In contrast, our model used a data-centric approach with annotations from an experienced oncologist, achieving a sensitivity of 95.52%, specificity of 96.21%, and an area under the curve of 0.989. The model was trained using 468 X-ray images from 31 osteosarcoma cases and 378 normal knee images with a strategy to maximize diversity in the training and validation sets. It was evaluated using an independent dataset of 268 osteosarcoma and 554 normal knee images to ensure generalizability. By applying the U-net architecture and advanced image processing techniques such as renormalization and affine transformations, our AI model outperforms existing models, reducing missed diagnoses and enhancing patient outcomes by facilitating earlier treatment. This study highlights the importance of high-quality training data and advocates a shift towards data-centric AI development in medical imaging. These insights can be extended to other rare cancers and diseases, underscoring the potential of AI in transforming diagnostic processes in oncology. The integration of this AI model into clinical workflows could support physicians in early osteosarcoma detection, thereby improving diagnostic accuracy and patient care.

Keywords: artificial intelligence; clinical decision support; diagnostic imaging; image annotation; osteosarcoma detection.

PubMed Disclaimer

Conflict of interest statement

The author, Joe Hasei, is affiliated with an endowed chair funded by Plusman LLC (Tokyo, Japan). This article was written as part of his role at the institution and reflects the academic and scientific standards and policies of the university. Joe Hasei has no financial relationships or interests to disclose that are directly relevant to the content of this study aside from institutional support. The other authors have no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
Illustration of the network architecture used in our study, based on U‐Net shaped convolutional neural networks. The architecture processes a 2D input image through several steps, including pixel de‐shuffle before the first convolution and pixel shuffle after the last convolution to reduce the total number of network parameters. Convolutional operations are performed in each unit, referred to as a “Res block,” which is similar to a residual block with a skip connection. The network output employs multiresolution maps, akin to the U‐Net architecture.
FIGURE 2
FIGURE 2
Evaluation of artificial intelligence (AI)‐based lesion detection in X‐ray images. (A) Score distribution. Histogram showing the frequency distribution of lesion detection probabilities assigned by the AI model to the test set of X‐ray images, with an overlaid kernel density estimate indicating the score distribution pattern. (B) Receiver operating characteristic (ROC) curve. The ROC curve for the AI model, depicting the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings. The area under the curve (AUC) is 0.989, illustrating the diagnostic accuracy of the model.
FIGURE 3
FIGURE 3
Osteosarcoma radiographs of the distal femur with artificial intelligence (AI) analysis results. (A) Original radiograph. X‐ray image of a patient's knee with osteosarcoma in the distal femur, where the tumor may be difficult to detect by untrained observers. (B) AI‐processed radiograph. X‐ray image with superimposed lesion detection; the red outline marks the tumor boundary as determined by expert human annotation, and the green outline shows the lesion area as identified by the AI model, highlighting the utility of the model in detecting subtle osteosarcoma manifestations. The lesion detection score of the AI model was 0.93, and the Dice coefficient was 0.63.
FIGURE 4
FIGURE 4
Scatter plot illustrating the correlation between lesion detection scores using artificial intelligence (AI) and Dice coefficients. The plot maps the lesion detection score assigned by the AI (x‐axis) against the Dice coefficient (y‐axis), which measures the agreement between the AI‐detected lesions and the expert‐annotated ground truth. A significant positive correlation (Pearson's correlation, 0.68; p < 0.001) shows that higher detection probabilities were generally associated with higher Dice coefficients, indicating reliable lesion identification using AI.
FIGURE 5
FIGURE 5
Artificial intelligence (AI) segmentation results with high and low Dice coefficients. For each case (A, B): left, original radiograph of the distal femur showing an osteosarcoma lesion; right, AI‐processed image with superimposed lesion detection. The red outline indicates the tumor boundary, as determined by expert human annotation, whereas the green outline represents the lesion area identified by the AI model. (A) High Dice coefficient case. The lesion detection score of the AI model was 0.99, and the Dice coefficient was 0.94, showing close agreement between AI segmentation and expert annotation. (B) Low Dice coefficient case. The AI model's lesion detection score was 0.82 and the Dice coefficient was 0.46. Despite the lower Dice coefficients, the AI model still identifies the primary area of concern, indicating its ability to detect subtle lesions even when full segmentation is challenging.
FIGURE 6
FIGURE 6
Osteosarcoma radiographs of the distal femur on whole lower extremity images and artificial intelligence (AI) reading results. (A) Original radiograph. A comprehensive full‐leg X‐ray depicting the distal femur, where a lesion is present but may be difficult to discern due to its small size in the context of the whole limb. (B) AI‐enhanced radiograph. X‐rays with AI detection overlays. The red contour represents the expert‐determined lesion boundary, while the green contour indicates the AI‐identified osteosarcoma lesion. The green markings in the pelvic area, which do not correspond to the red outline, indicate regions where the AI incorrectly identified intestinal shadows as potential osteosarcoma lesions.

References

    1. Savage SA, Mirabello L. Using epidemiology and genomics to understand osteosarcoma etiology. Sarcoma. 2011;2011:548151. - PMC - PubMed
    1. Stiller CA, Bielack SS, Jundt G, Steliarova‐Foucher E. Bone tumours in European children and adolescents, 1978–1997. Report from the automated childhood cancer information system project. Eur J Cancer. 2006;42:2124‐2135. - PubMed
    1. Gardner MH, Barnes MJ, Bopanna S, et al. Barriers to the use of psychosocial support services among adolescent and young adult survivors of pediatric cancer. J Adolesc Young Adult Oncol. 2014;3:112‐116. - PMC - PubMed
    1. Bleyer A, Barr R, Ries L, Whelan J, Ferrari A. Cancer in Adolescents and Young Adults. Springer; 2016. - PubMed
    1. Zebrack BJ, Block R, Hayes‐Lattin B, et al. Psychosocial service use and unmet need among recently diagnosed adolescent and young adult cancer patients. Cancer. 2013;119:201‐214. - PubMed