Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 24:16:27.
doi: 10.4103/ijpvm.ijpvm_306_23. eCollection 2025.

Machine Learning Helps in Prediction of Tobacco Smoking in Adolescents

Affiliations

Machine Learning Helps in Prediction of Tobacco Smoking in Adolescents

Hamidreza Roohafza et al. Int J Prev Med. .

Abstract

Background: Considering the increasing prevalence of adolescent smoking in recent years, this study proposes a machine learning (ML) approach for distinguishing adolescents who are prone to start smoking and those who do not directly confess to smoking.

Methods: We used two repeated measures cross-sectional studies, including data from 7940 individuals as distinct training and test datasets. Utilizing the randomized least absolute shrinkage and selector operator (LASSO), the most influential factors were selected. We then investigated the performance of different ML approaches for the automatic classification of students into smoker/nonsmoker and low-risk/high-risk categories.

Results: Randomized LASSO feature selection prioritized 15 factors, including peer influence, risky behaviors, attitude and school policy toward smoking, family factors, depression, and sex as the most influential factors in smoking. Applying different ML approaches to the three study plans yielded an AUC of up to 0.92, sensitivity of up to 0.88, PPV of up to 0.72, specificity of up to 0.98, and NPV of up to 0.99.

Conclusions: The results showed the capability of our ML approach to distinguish between classes of smokers and nonsmokers. This model can be used as a brief screening tool for automated prediction of individuals susceptible to smoking for more precise preventive intervention plans focusing on adolescents.

Keywords: Adolescent; classification; machine learning; prediction; tobacco.

PubMed Disclaimer

Conflict of interest statement

There are no conflicts of interest.

Figures

Figure 1
Figure 1
Description of the three output definition (OD)
Figure 2
Figure 2
Flowchart of the proposed method. The utilized classifier are Logistic Regression Classifier (LRC), Support Vector Machine (SVM), Random Forest Classifier (RFC), Gradient Boosting classifier (GBC), and AdaBoost Classifier (ADB). The performance evaluation metrics are: Area Under Curve (AUC), Sensitivity, Specificity, Positive Predictive Value (PPV) and Negative Predictive Value (NPV)
Figure 3
Figure 3
Average of the answers to the 15 selected variables, in training and test data sets. PI, Peer Influences; SP, School Policies toward smoking; A, Attitude toward smoking; F, Family factor; RB, Risky Behaviors; D, Depression; and S, Sex. Figures (I) and (II) are related to the study population of 2010 (Training samples) and, Figures (III) and (IV) are related to the study population of 2015 (Test samples)

Similar articles

References

    1. WHO Report on the Global Tobacco Epidemic 2019: Offer Help to quit tobacco use World Health Organization. 2019
    1. Barnes J, McRobbie H, Dong CY, Walker N, Hartmann-Boyce J. Hypnotherapy for smoking cessation. Cochrane Database Syst Revi. 2019;2019 CD001008. - PMC - PubMed
    1. Perez-Warnisher MT, de Miguel MdPC, Seijo LM. Tobacco use worldwide: Legislative efforts to curb consumption. Ann Glob Health. 2018;84:571. 9. - PMC - PubMed
    1. Warren CW, Riley L, Asma S, Eriksen MP, Green L, Blanton C, et al. Tobacco use by youth: A surveillance report from the Global Youth Tobacco Survey project. Bull World Health Organ. 2000;78:868–76. - PMC - PubMed
    1. Duncan LR, Pearson ES, Maddison R. Smoking prevention in children and adolescents: A systematic review of individualized interventions. Patient Educ Couns. 2018;101:375–88. - PubMed

LinkOut - more resources