Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 27;12(13):4297.
doi: 10.3390/jcm12134297.

Neural Network Approach to Investigating the Importance of Test Items for Predicting Physical Activity in Chronic Obstructive Pulmonary Disease

Affiliations

Neural Network Approach to Investigating the Importance of Test Items for Predicting Physical Activity in Chronic Obstructive Pulmonary Disease

Yoshiki Nakahara et al. J Clin Med. .

Abstract

Contracting COPD reduces a patient's physical activity and restricts everyday activities (physical activity disorder). However, the fundamental cause of physical activity disorder has not been found. In addition, costly and specialized equipment is required to accurately examine the disorder; hence, it is not regularly assessed in normal clinical practice. In this study, we constructed a machine learning model to predict physical activity using test items collected during the normal care of COPD patients. In detail, we first applied three types of data preprocessing methods (zero-padding, multiple imputation by chained equations (MICE), and k-nearest neighbor (kNN)) to complement missing values in the dataset. Then, we constructed several types of neural networks to predict physical activity. Finally, permutation importance was calculated to identify the importance of the test items for prediction. Multifactorial analysis using machine learning, including blood, lung function, walking, and chest imaging tests, was the unique point of this research. From the experimental results, it was found that the missing value processing using MICE contributed to the best prediction accuracy (73.00%) compared to that using zero-padding (68.44%) or kNN (71.52%), and showed better accuracy than XGBoost (66.12%) with a significant difference (p < 0.05). For patients with severe physical activity reduction (total exercise < 1.5), a high sensitivity (89.36%) was obtained. The permutation importance showed that "sex, the number of cigarettes, age, and the whole body phase angle (nutritional status)" were the most important items for this prediction. Furthermore, we found that a smaller number of test items could be used in ordinary clinical practice for the screening of physical activity disorder.

Keywords: COPD; autoencoder; neural network; physical activity; prediction.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The model structure of the 3-layer autoencoder used for pre-training. There were 34 units in the input layer. In the hidden layer, a feature vector (feature values) that encoded the inputs was obtained. There were 34 units in the output layer, in which the inputs were reconstructed from the hidden layer vectors. The number of units in the hidden layer was 20, i.e., the feature values were represented by 20-dimensional vectors.
Figure 2
Figure 2
The model structure used for the fine-tuning, which was constructed after pre-training with the 3-layer autoencoder. The output layer in Figure 1 was removed and the model was connected to a new output layer to predict T-Ex1 and T-Ex2.
Figure 3
Figure 3
The model structure of the 4-layer autoencoder used for pre-training. The input layer had 34 units. In the two hidden layers, the vectors (feature values) that encoded the inputs were obtained. The output layer had 34 units and the inputs were reconstructed from the hidden layer vectors.
Figure 4
Figure 4
The model structure used for fine-tuning, which was constructed after pre-training with the 4-layer autoencoder. The output layer in Figure 3 was removed and the model was connected to a new output layer to predict T-Ex1 and T-Ex2.
Figure 5
Figure 5
The importance of each test item. The horizontal axis shows the test items and the vertical axis shows their importance. The test items in order of effect on prediction performance were sex, the number of cigarettes per day, age, whole body phase angle, DDR, mMRC, FEV1, %RV/TLC, pack years, and BNP.

Similar articles

Cited by

References

    1. Cooper C.B. Airflow obstruction and exercise. Respir. Med. 2009;103:325–334. doi: 10.1016/j.rmed.2008.10.026. - DOI - PubMed
    1. Waschki B., Kirsten A., Holz O., Müller K.C., Meyer T., Watz H., Magnussen H. Physical Activity Is the Strongest Predictor of All-Cause Mortality in Patients With COPD: A Prospective Cohort Study. Chest. 2011;140:331–342. doi: 10.1378/chest.10-2521. - DOI - PubMed
    1. Troosters T., van der Molen T., Polkey M., Rabinovich R.A., Vogiatzis I., Weisman I., Kulich K. Improving physical activity in COPD: Towards a new paradigm. Respir. Res. 2013;14:115. doi: 10.1186/1465-9921-14-115. - DOI - PMC - PubMed
    1. Hirano T., Doi K., Matsunaga K., Takahashi S., Donishi T., Suga K., Oishi K., Yasuda K., Mimura Y., Harada M., et al. A Novel Role of Growth Differentiation Factor (GDF)-15 in Overlap with Sedentary Lifestyle and Cognitive Risk in COPD. J. Clin. Med. 2020;9:2737. doi: 10.3390/jcm9092737. - DOI - PMC - PubMed
    1. Matsunaga K., Harada M., Suizu J., Oishi K., Asami-Noyama M., Hirano T. Comorbid Conditions in Chronic Obstructive Pulmonary Disease: Potential Therapeutic Targets for Unmet Needs. J. Clin. Med. 2020;9:3078. doi: 10.3390/jcm9103078. - DOI - PMC - PubMed

LinkOut - more resources