Using machine learning to extract information and predict outcomes from reports of randomised trials of smoking cessation interventions in the Human Behaviour-Change Project
- PMID: 38779058
- PMCID: PMC11109593
- DOI: 10.12688/wellcomeopenres.20000.2
Using machine learning to extract information and predict outcomes from reports of randomised trials of smoking cessation interventions in the Human Behaviour-Change Project
Abstract
Background: Using reports of randomised trials of smoking cessation interventions as a test case, this study aimed to develop and evaluate machine learning (ML) algorithms for extracting information from study reports and predicting outcomes as part of the Human Behaviour-Change Project. It is the first of two linked papers, with the second paper reporting on further development of a prediction system.
Methods: Researchers manually annotated 70 items of information ('entities') in 512 reports of randomised trials of smoking cessation interventions covering intervention content and delivery, population, setting, outcome and study methodology using the Behaviour Change Intervention Ontology. These entities were used to train ML algorithms to extract the information automatically. The information extraction ML algorithm involved a named-entity recognition system using the 'FLAIR' framework. The manually annotated intervention, population, setting and study entities were used to develop a deep-learning algorithm using multiple layers of long-short-term-memory (LSTM) components to predict smoking cessation outcomes.
Results: The F1 evaluation score, derived from the false positive and false negative rates (range 0-1), for the information extraction algorithm averaged 0.42 across different types of entity (SD=0.22, range 0.05-0.88) compared with an average human annotator's score of 0.75 (SD=0.15, range 0.38-1.00). The algorithm for assigning entities to study arms ( e.g., intervention or control) was not successful. This initial ML outcome prediction algorithm did not outperform prediction based just on the mean outcome value or a linear regression model.
Conclusions: While some success was achieved in using ML to extract information from reports of randomised trials of smoking cessation interventions, we identified major challenges that could be addressed by greater standardisation in the way that studies are reported. Outcome prediction from smoking cessation studies may benefit from development of novel algorithms, e.g., using ontological information to inform ML (as reported in the linked paper 1).
Keywords: artificial intelligence; behaviour change interventions; evidence synthesis; information extractions; machine learning; natural language processing; ontologies; prediction systems.
Copyright: © 2024 West R et al.
Conflict of interest statement
Competing interests: RW and SM are unpaid directors of the Unlocking Behaviour Change Community Interest Company.
Figures
Similar articles
-
Interventions to reduce harm from continued tobacco use.Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3. Cochrane Database Syst Rev. 2016. PMID: 27734465 Free PMC article.
-
Effectiveness and cost-effectiveness of computer and other electronic aids for smoking cessation: a systematic review and network meta-analysis.Health Technol Assess. 2012;16(38):1-205, iii-v. doi: 10.3310/hta16380. Health Technol Assess. 2012. PMID: 23046909
-
Interventions to increase adherence to medications for tobacco dependence.Cochrane Database Syst Rev. 2015 Feb 23;(2):CD009164. doi: 10.1002/14651858.CD009164.pub2. Cochrane Database Syst Rev. 2015. Update in: Cochrane Database Syst Rev. 2019 Aug 16;8:CD009164. doi: 10.1002/14651858.CD009164.pub3. PMID: 25914910 Updated.
-
Pharmacological and electronic cigarette interventions for smoking cessation in adults: component network meta-analyses.Cochrane Database Syst Rev. 2023 Sep 12;9(9):CD015226. doi: 10.1002/14651858.CD015226.pub2. Cochrane Database Syst Rev. 2023. PMID: 37696529 Free PMC article.
-
Smoking cessation medicines and e-cigarettes: a systematic review, network meta-analysis and cost-effectiveness analysis.Health Technol Assess. 2021 Oct;25(59):1-224. doi: 10.3310/hta25590. Health Technol Assess. 2021. PMID: 34668482
Cited by
-
A data extraction template for the behaviour change intervention ontology.Wellcome Open Res. 2024 Mar 26;9:168. doi: 10.12688/wellcomeopenres.20872.1. eCollection 2024. Wellcome Open Res. 2024. PMID: 38873399 Free PMC article.
-
The Behaviour Change Technique Ontology: Transforming the Behaviour Change Technique Taxonomy v1.Wellcome Open Res. 2024 May 9;8:308. doi: 10.12688/wellcomeopenres.19363.2. eCollection 2023. Wellcome Open Res. 2024. PMID: 37593567 Free PMC article.
-
From smoking cessation to physical activity: Can ontology-based methods for automated evidence synthesis generalise across behaviour change domains?Wellcome Open Res. 2025 Mar 24;9:402. doi: 10.12688/wellcomeopenres.21664.2. eCollection 2024. Wellcome Open Res. 2025. PMID: 40225272 Free PMC article.
-
The BSSO Foundry: A community of practice for ontologies in the behavioural and social sciences.Wellcome Open Res. 2024 Nov 7;9:656. doi: 10.12688/wellcomeopenres.23230.1. eCollection 2024. Wellcome Open Res. 2024. PMID: 39664869 Free PMC article.
References
-
- West R, Michie S: How many papers are published each week reporting on trials of interventions involving behavioural aspects of health?Qeios. 10.32388/U6VX2Z - DOI
-
- Hastings J, Glauer M, West R, et al. : Predicting outcomes of smoking cessation interventions in novel scenarios using ontology-informed, interpretable machine learning [version 1; peer review: 1 approved, 1 approved with reservations]. Wellcome Open Res. 2023;8:503. 10.12688/wellcomeopenres.20012.1 - DOI
-
- Gough D, Oliver S, Thomas J: An introduction to systematic reviews.SAGE;2017;353. Reference Source
Grants and funding
LinkOut - more resources
Full Text Sources