Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Oct;11(13):1469-1486.
doi: 10.2217/epi-2019-0206. Epub 2019 Aug 30.

EpiSmokEr: a robust classifier to determine smoking status from DNA methylation data

Affiliations

EpiSmokEr: a robust classifier to determine smoking status from DNA methylation data

Sailalitha Bollepalli et al. Epigenomics. 2019 Oct.

Abstract

Aim: Smoking strongly influences DNA methylation, with current and never smokers exhibiting different methylation profiles. Methods: To advance the practical applicability of the smoking-associated methylation signals, we used machine learning methodology to train a classifier for smoking status prediction. Results: We show the prediction performance of our classifier on three independent whole-blood datasets demonstrating its robustness and global applicability. Furthermore, we examine the reasons for biologically meaningful misclassifications through comprehensive phenotypic evaluation. Conclusion: The major contribution of our classifier is its global applicability without a need for users to determine a threshold value for each dataset to predict the smoking status. We provide an R package, EpiSmokEr (Epigenetic Smoking status Estimator), facilitating the use of our classifier to predict smoking status in future studies.

Keywords: DNA methylation; epigenetic smoking status; multinomial LASSO; smoking status classifier; tobacco smoking.

PubMed Disclaimer

Publication types

LinkOut - more resources