Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 24;11(5):765.
doi: 10.3390/diagnostics11050765.

Intelligent Bone Age Assessment: An Automated System to Detect a Bone Growth Problem Using Convolutional Neural Networks with Attention Mechanism

Affiliations

Intelligent Bone Age Assessment: An Automated System to Detect a Bone Growth Problem Using Convolutional Neural Networks with Attention Mechanism

Mohd Asyraf Zulkifley et al. Diagnostics (Basel). .

Abstract

Skeletal bone age assessment using X-ray images is a standard clinical procedure to detect any anomaly in bone growth among kids and babies. The assessed bone age indicates the actual level of growth, whereby a large discrepancy between the assessed and chronological age might point to a growth disorder. Hence, skeletal bone age assessment is used to screen the possibility of growth abnormalities, genetic problems, and endocrine disorders. Usually, the manual screening is assessed through X-ray images of the non-dominant hand using the Greulich-Pyle (GP) or Tanner-Whitehouse (TW) approach. The GP uses a standard hand atlas, which will be the reference point to predict the bone age of a patient, while the TW uses a scoring mechanism to assess the bone age using several regions of interest information. However, both approaches are heavily dependent on individual domain knowledge and expertise, which is prone to high bias in inter and intra-observer results. Hence, an automated bone age assessment system, which is referred to as Attention-Xception Network (AXNet) is proposed to automatically predict the bone age accurately. The proposed AXNet consists of two parts, which are image normalization and bone age regression modules. The image normalization module will transform each X-ray image into a standardized form so that the regressor network can be trained using better input images. This module will first extract the hand region from the background, which is then rotated to an upright position using the angle calculated from the four key-points of interest. Then, the masked and rotated hand image will be aligned such that it will be positioned in the middle of the image. Both of the masked and rotated images will be obtained through existing state-of-the-art deep learning methods. The last module will then predict the bone age through the Attention-Xception network that incorporates multiple layers of spatial-attention mechanism to emphasize the important features for more accurate bone age prediction. From the experimental results, the proposed AXNet achieves the lowest mean absolute error and mean squared error of 7.699 months and 108.869 months2, respectively. Therefore, the proposed AXNet has demonstrated its potential for practical clinical use with an error of less than one year to assist the experts or radiologists in evaluating the bone age objectively.

Keywords: X-ray image; attention mechanism; bone growth disorder; convolutional neural network; regression network.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
An example of bone changes between an 8-month-old baby and a 5-year-old toddler.
Figure 2
Figure 2
The ROIs of a hand X-ray image for the TW approach. The red boxes refer to the phalangeal joints, the yellow box refers to the carpal region, the blue box refers to the radius physis region, and the green box refers to the ulna physis region.
Figure 3
Figure 3
The workflow of the proposed automated assessment of the bone age.
Figure 4
Figure 4
Key-points alignment that rotates the segmented hand region such that the tip of the middle finger and the lower end of the carpal region form a straight vertical line.
Figure 5
Figure 5
The full architecture of the proposed Attention-Xception Network.
Figure 6
Figure 6
The architecture of a three-layer residual skip connection unit.
Figure 7
Figure 7
The full architecture of the attention mechanism in AXNet.
Figure 8
Figure 8
Some samples of the X-ray image that have been normalized by the three normalization modules.

Similar articles

Cited by

References

    1. Iglovikov V.I., Rakhlin A., Kalinin A.A., Shvets A.A. Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support. Springer; Cham, Switzerland: 2018. Paediatric bone age assessment using deep convolutional neural networks; pp. 300–308.
    1. Ren X., Li T., Yang X., Wang S., Ahmad S., Xiang L., Stone S.R., Li L., Zhan Y., Shen D., et al. Regression convolutional neural network for automated pediatric bone age assessment from hand radiograph. IEEE J. Biomed. Health Inform. 2018;23:2030–2038. doi: 10.1109/JBHI.2018.2876916. - DOI - PubMed
    1. Mutasa S., Chang P.D., Ruzal-Shapiro C., Ayyala R. MABAL: A novel deep-learning architecture for machine-assisted bone age labeling. J. Digit. Imaging. 2018;31:513–519. doi: 10.1007/s10278-018-0053-3. - DOI - PMC - PubMed
    1. Hao P., Chen Y., Chokuwa S., Wu F., Bai C. Pacific Rim Conference on Multimedia. Springer; Cham, Switherland: 2018. Skeletal bone age assessment based on deep convolutional neural networks; pp. 408–417.
    1. Liu Y., Zhang C., Cheng J., Chen X., Wang Z.J. A multi-scale data fusion framework for bone age assessment with convolutional neural networks. Comput. Biol. Med. 2019;108:161–173. doi: 10.1016/j.compbiomed.2019.03.015. - DOI - PubMed

LinkOut - more resources