Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov 20:8:452.
doi: 10.12688/wellcomeopenres.20000.2. eCollection 2023.

Using machine learning to extract information and predict outcomes from reports of randomised trials of smoking cessation interventions in the Human Behaviour-Change Project

Affiliations

Using machine learning to extract information and predict outcomes from reports of randomised trials of smoking cessation interventions in the Human Behaviour-Change Project

Robert West et al. Wellcome Open Res. .

Abstract

Background: Using reports of randomised trials of smoking cessation interventions as a test case, this study aimed to develop and evaluate machine learning (ML) algorithms for extracting information from study reports and predicting outcomes as part of the Human Behaviour-Change Project. It is the first of two linked papers, with the second paper reporting on further development of a prediction system.

Methods: Researchers manually annotated 70 items of information ('entities') in 512 reports of randomised trials of smoking cessation interventions covering intervention content and delivery, population, setting, outcome and study methodology using the Behaviour Change Intervention Ontology. These entities were used to train ML algorithms to extract the information automatically. The information extraction ML algorithm involved a named-entity recognition system using the 'FLAIR' framework. The manually annotated intervention, population, setting and study entities were used to develop a deep-learning algorithm using multiple layers of long-short-term-memory (LSTM) components to predict smoking cessation outcomes.

Results: The F1 evaluation score, derived from the false positive and false negative rates (range 0-1), for the information extraction algorithm averaged 0.42 across different types of entity (SD=0.22, range 0.05-0.88) compared with an average human annotator's score of 0.75 (SD=0.15, range 0.38-1.00). The algorithm for assigning entities to study arms ( e.g., intervention or control) was not successful. This initial ML outcome prediction algorithm did not outperform prediction based just on the mean outcome value or a linear regression model.

Conclusions: While some success was achieved in using ML to extract information from reports of randomised trials of smoking cessation interventions, we identified major challenges that could be addressed by greater standardisation in the way that studies are reported. Outcome prediction from smoking cessation studies may benefit from development of novel algorithms, e.g., using ontological information to inform ML (as reported in the linked paper 1).

Keywords: artificial intelligence; behaviour change interventions; evidence synthesis; information extractions; machine learning; natural language processing; ontologies; prediction systems.

PubMed Disclaimer

Conflict of interest statement

Competing interests: RW and SM are unpaid directors of the Unlocking Behaviour Change Community Interest Company.

Figures

Figure 1.
Figure 1.. Overview of the Human Behaviour Change Project knowledge system.
Figure 2.
Figure 2.. Overview of automated information extraction pipeline.
Figure 3.
Figure 3.. Overview of the outcome prediction ML algorithm.

Similar articles

Cited by

References

    1. Michie S, Thomas J, Johnston M, et al. : The Human Behaviour-Change Project: harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation. Implement Sci. 2017;12(1): 121. 10.1186/s13012-017-0641-5 - DOI - PMC - PubMed
    1. West R, Michie S: How many papers are published each week reporting on trials of interventions involving behavioural aspects of health?Qeios. 10.32388/U6VX2Z - DOI
    1. Hastings J, Glauer M, West R, et al. : Predicting outcomes of smoking cessation interventions in novel scenarios using ontology-informed, interpretable machine learning [version 1; peer review: 1 approved, 1 approved with reservations]. Wellcome Open Res. 2023;8:503. 10.12688/wellcomeopenres.20012.1 - DOI
    1. Gough D, Oliver S, Thomas J: An introduction to systematic reviews.SAGE;2017;353. Reference Source
    1. Allen IE, Olkin I: Estimating time to conduct a meta-analysis from number of citations retrieved. JAMA. 1999;282(7):634–5. 10.1001/jama.282.7.634 - DOI - PubMed

LinkOut - more resources