Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 16;12(1):6388.
doi: 10.1038/s41598-022-10292-y.

Accuracy and self-validation of automated bone age determination

Affiliations

Accuracy and self-validation of automated bone age determination

D D Martin et al. Sci Rep. .

Abstract

The BoneXpert method for automated determination of bone age from hand X-rays was introduced in 2009 and is currently running in over 200 hospitals. The aim of this work is to present version 3 of the method and validate its accuracy and self-validation mechanism that automatically rejects an image if it is at risk of being analysed incorrectly. The training set included 14,036 images from the 2017 Radiological Society of North America (RSNA) Bone Age Challenge, 1642 images of normal Dutch and Californian children, and 8250 images from Tübingen from patients with Short Stature, Congenital Adrenal Hyperplasia and Precocious Puberty. The study resulted in a cross-validated root mean square (RMS) error in the Tübingen images of 0.62 y, compared to 0.72 y in the previous version. The RMS error on the RSNA test set of 200 images was 0.45 y relative to the average of six manual ratings. The self-validation mechanism rejected 0.4% of the RSNA images. 121 outliers among the self-validated images of the Tübingen study were rerated, resulting in 6 cases where BoneXpert deviated more than 1.5 years from the average of the three re-ratings, compared to 72 such cases for the original manual ratings. The accuracy of BoneXpert is clearly better than the accuracy of a single manual rating. The self-validation mechanism rejected very few images, typically with abnormal anatomy, and among the accepted images, there were 12 times fewer severe bone age errors than in manual ratings, suggesting that BoneXpert could be safer than manual rating.

PubMed Disclaimer

Conflict of interest statement

H.H.T. is the owner of Visiana, which develops and markets the BoneXpert medical device for bone age assessment. The other authors declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Figures

Figure 1
Figure 1
Flowchart of the software architecture. The architecture is dominated by the six steps of the selfvalidation. The first two steps validate individual bone contributions, while the last four validate the entire image.
Figure 2
Figure 2
Posterior-anterior left-hand radiograph with bone age assessment by BoneXpert. This is the result file automatically placed in the same study in the digital archive. BA = bone age, averaged over the 21 tubular bones; BHI = Bone Health Index; SDS = standard deviation score; GP = Greulich-Pyle; TW3 = Tanner-Whitehouse Version 3.
Figure 3
Figure 3
The agreement between automated and manual rating in the three Tübingen studies of children with Short Stature, Congenital Adrenal Hyperplasia or Precocious Puberty. The manual ratings are the original manual ratings without any corrections. The lines at ± 1.8 years define the disputed cases to be rerated 3 times. On the horizontal axis is the average of manual and automated bone age rating.
Figure 4
Figure 4
Bland Altmann plot for the agreement between the automated rating and the manual rating for the 200 cases in the RSNA test set.
Figure 5
Figure 5
Bland Altman plot of the agreement between automated and manual bone age rating below 2 y.
Figure 6
Figure 6
Females 2707 and 3781 from the RSNA study. The automated method accepted only 9 and 10 of the 21 tubular bones, which was just above the 8 bones needed for the entire analysis to succeed. In the first image, the overlap between radius and lunate is likely to have caused radius to be rejected; and the pose of the middle and distal phalanges in finger 2–4 is incorrect due to flexion. In the second image, there is brachydactyly in the second and fifth middle phalanges, and the second and third proximal phalanges have thick epiphyses, suggesting the monogenic disorder Brachydactyly Type C. For clarity, the bone contours are drawn in alternating colours. The numbers indicate the GP bone ages.
Figure 7
Figure 7
Two females, 4210 and 13,824, from the RSNA study with 10 and 9 accepted tubular bones. Ray 2 is missing in the first image, which causes the Bone Health Index analysis to fail (indicated by the code 3). The radial epiphysis is abnormally thickened and there is pronounced positive ulnar variance. The first digit is likely to be a pollicised index finger following surgery for thumb aplasia or severe hypoplasia; a likely case of radial longitudinal reduction defect. The second image shows brachydactyly in the middle phalanges. Finger 1 has two epiphyses in the metacarpal, and either a thick epiphysis in the distal phalanx or an extra short phalanx; the thumb is somewhat ‘finger-like’. The second and fifth digits similarly appear biphalangeal with large epiphyses of the distal phalanges. Digits 3 and 4 show middle phalangeal hypoplasia. Appearances suggest an underlying genetic disorder of hand-development requiring further evaluation.

References

    1. Greulich WW, Pyle SI. Radiographic atlas of skeletal development of the hand and wrist. 2. California: Stanford University Press; 1959.
    1. Martin DD, et al. The use of bone age in clinical practice - part 1. Horm. Res. Paediatr. 2011;76:1–9. doi: 10.1159/000329372. - DOI - PubMed
    1. Martin DD, et al. The use of bone age in clinical practice - part 2. Horm. Res. Paediatr. 2011;76:10–16. doi: 10.1159/000329374. - DOI - PubMed
    1. Thodberg HH, Kreiborg S, Juul A, Pedersen KD. The BoneXpert method for automated determination of skeletal maturity. IEEE Trans. Med. Imaging. 2009;28:52–66. doi: 10.1109/TMI.2008.926067. - DOI - PubMed
    1. Kaplowitz P, et al. Comparison of bone age readings by pediatric endocrinologists and pediatric radiologists using two bone age atlases. Pediatr. Radiol. 2010;41:690–693. doi: 10.1007/s00247-010-1915-0. - DOI - PubMed

LinkOut - more resources