Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Mar 17;22(3):e17695.
doi: 10.2196/17695.

Identifying Lung Cancer Risk Factors in the Elderly Using Deep Neural Networks: Quantitative Analysis of Web-Based Survey Data

Affiliations

Identifying Lung Cancer Risk Factors in the Elderly Using Deep Neural Networks: Quantitative Analysis of Web-Based Survey Data

Songjing Chen et al. J Med Internet Res. .

Abstract

Background: Lung cancer is one of the most dangerous malignant tumors, with the fastest-growing morbidity and mortality, especially in the elderly. With a rapid growth of the elderly population in recent years, lung cancer prevention and control are increasingly of fundamental importance, but are complicated by the fact that the pathogenesis of lung cancer is a complex process involving a variety of risk factors.

Objective: This study aimed at identifying key risk factors of lung cancer incidence in the elderly and quantitatively analyzing these risk factors' degree of influence using a deep learning method.

Methods: Based on Web-based survey data, we integrated multidisciplinary risk factors, including behavioral risk factors, disease history factors, environmental factors, and demographic factors, and then preprocessed these integrated data. We trained deep neural network models in a stratified elderly population. We then extracted risk factors of lung cancer in the elderly and conducted quantitative analyses of the degree of influence using the deep neural network models.

Results: The proposed model quantitatively identified risk factors based on 235,673 adults. The proposed deep neural network models of 4 groups (age ≥65 years, women ≥65 years old, men ≥65 years old, and the whole population) achieved good performance in identifying lung cancer risk factors, with accuracy ranging from 0.927 (95% CI 0.223-0.525; P=.002) to 0.962 (95% CI 0.530-0.751; P=.002) and the area under curve ranging from 0.913 (95% CI 0.564-0.803) to 0.931(95% CI 0.499-0.593). Smoking frequency was the leading risk factor for lung cancer in men 65 years and older. Time since quitting and smoking at least 100 cigarettes in their lifetime were the main risk factors for lung cancer in women 65 years and older. Men 65 years and older had the highest lung cancer incidence among the stratified groups, particularly non-small cell lung cancer incidence. Lung cancer incidence decreased more obviously in men than in women with smoking rate decline.

Conclusions: This study demonstrated a quantitative method to identify risk factors of lung cancer in the elderly. The proposed models provided intervention indicators to prevent lung cancer, especially in older men. This approach might be used as a risk factor identification tool to apply in other cancers and help physicians make decisions on cancer prevention.

Keywords: aged; deep learning; lung cancer; primary prevention; risk factors.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: None declared.

Figures

Figure 1
Figure 1
Data selection flowchart. BRFSS: Behavioral Risk Factor Surveillance System.
Figure 2
Figure 2
Schematic diagram of lung cancer risk factor identification in the elderly. DNN: deep neural network.
Figure 3
Figure 3
Deep learning model training process. DNN: deep neural network; HDF5: hierarchical data format version 5.
Figure 4
Figure 4
Data analysis equations.
Figure 5
Figure 5
Normalized weights of risk factors in the stratified groups. BMI: body mass index; CAT: computerized axial tomography; COPD: chronic obstructive pulmonary disease; CT: computed tomography; PM2.5: fine particulate matter with a diameter ≤2.5 μm.
Figure 6
Figure 6
Relationship between smoking and lung cancer incidence, 1996-2015.

References

    1. U.S. National Library of Medicine . MedlinePlus. Non-small cell lung cancer. Bethesda, MD: U.S. Department of Health and Human Services, National Institutes of Health; 2019. [2019-06-20]. https://medlineplus.gov/ency/article/007194.htm.
    1. Schuller HM. The impact of smoking and the influence of other factors on lung cancer. Expert Rev Respir Med. 2019 Aug;13(8):761–769. doi: 10.1080/17476348.2019.1645010. - DOI - PubMed
    1. Park SK, Cho LY, Yang JJ, Park B, Chang SH, Lee K, Kim H, Yoo K, Lee C, Scientific Committee‚ Korean Academy of Tuberculosis and Respiratory Diseases Lung cancer risk and cigarette smoking, lung tuberculosis according to histologic type and gender in a population based case-control study. Lung Cancer. 2010 Apr;68(1):20–6. doi: 10.1016/j.lungcan.2009.05.017. - DOI - PubMed
    1. Hahn EJ, Hooper M, Riker C, Butler KM, Rademacher K, Wiggins A, Rayens MK. Lung cancer worry and home screening for radon and secondhand smoke in renters. J Environ Health. 2017;79(6):8–13. http://europepmc.org/abstract/MED/29135198 - PMC - PubMed
    1. Yang W, Zhao H, Wang X, Deng Q, Fan W, Wang L. An evidence-based assessment for the association between long-term exposure to outdoor air pollution and the risk of lung cancer. Eur J Cancer Prev. 2016 May;25(3):163–72. doi: 10.1097/CEJ.0000000000000158. - DOI - PubMed

Publication types