Targeted maximum likelihood estimation for a binary treatment: A tutorial

Miguel Angel Luque-Fernandez^{1

2

3}, Michael Schomaker⁴, Bernard Rachet¹, Mireille E Schnitzer⁵

Affiliations

¹ Cancer Survival Group, Department of Non-Communicable Disease Epidemiology, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.
² Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
³ Biomedical Research Institute of Granada, Non-Communicable and Cancer Epidemiology Group (ibs.Granada), Andalusian School of Public Health, Granada, Spain.
⁴ School of Public Health and Family Medicine, Center for Infectious Disease Epidemiology and Research, The University of Cape Town, Cape Town, South Africa.
⁵ Faculté de pharmacie, Université de Montréal, Montréal, Canada.

PMID: 29687470
PMCID: PMC6032875
DOI: 10.1002/sim.7628

Targeted maximum likelihood estimation for a binary treatment: A tutorial

Miguel Angel Luque-Fernandez et al. Stat Med. 2018.

. 2018 Jul 20;37(16):2530-2546.

doi: 10.1002/sim.7628. Epub 2018 Apr 23.

Authors

Miguel Angel Luque-Fernandez^{1

2

3}, Michael Schomaker⁴, Bernard Rachet¹, Mireille E Schnitzer⁵

Affiliations

¹ Cancer Survival Group, Department of Non-Communicable Disease Epidemiology, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.
² Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
³ Biomedical Research Institute of Granada, Non-Communicable and Cancer Epidemiology Group (ibs.Granada), Andalusian School of Public Health, Granada, Spain.
⁴ School of Public Health and Family Medicine, Center for Infectious Disease Epidemiology and Research, The University of Cape Town, Cape Town, South Africa.
⁵ Faculté de pharmacie, Université de Montréal, Montréal, Canada.

PMID: 29687470
PMCID: PMC6032875
DOI: 10.1002/sim.7628

Abstract

When estimating the average effect of a binary treatment (or exposure) on an outcome, methods that incorporate propensity scores, the G-formula, or targeted maximum likelihood estimation (TMLE) are preferred over naïve regression approaches, which are biased under misspecification of a parametric outcome model. In contrast propensity score methods require the correct specification of an exposure model. Double-robust methods only require correct specification of either the outcome or the exposure model. Targeted maximum likelihood estimation is a semiparametric double-robust method that improves the chances of correct model specification by allowing for flexible estimation using (nonparametric) machine-learning methods. It therefore requires weaker assumptions than its competitors. We provide a step-by-step guided implementation of TMLE and illustrate it in a realistic scenario based on cancer epidemiology where assumptions about correct model specification and positivity (ie, when a study participant had 0 probability of receiving the treatment) are nearly violated. This article provides a concise and reproducible educational introduction to TMLE for a binary outcome and exposure. The reader should gain sufficient understanding of TMLE from this introductory tutorial to be able to apply the method in practice. Extensive R-code is provided in easy-to-read boxes throughout the article for replicability. Stata users will find a testing implementation of TMLE and additional material in the Appendix S1 and at the following GitHub repository: https://github.com/migariane/SIM-TMLE-tutorial.

Keywords: causal inference; ensemble Learning; machine learning; observational studies; targeted maximum likelihood estimation.

PubMed Disclaimer

Figures

**Figure 1**
Direct acyclic graph. Legend: Conditional exchangeability of the treatment effect or exposure (A) on cancer mortality (Y) is obtained through conditioning on a set of available covariates (Y(1),Y(0) ⊥ A|W). The average treatment effect for the structural framework is estimated as the average risk difference between the expected effect of the treatment conditional on W among those treated (E(Y|A = 1; W)) and the expected effect of the treatment conditional on W among those untreated (E(Y|A = 0; W)). Y: mortality binary indicator (1 death, 0 alive), A: binary treatment for cancer with monotherapy versus dual therapy (1 Mono; 0 Dual); W: W ₁: sex; W ₂: age at diagnosis; W ₃: cancer stage, TNM classification; W ₄: comorbidities [Colour figure can be viewed at http://wileyonlinelibrary.com]

**Figure 2**
Probability density function of the propensity score by treatment status for one randomly selected sample from 1000 Monte Carlo simulations [Colour figure can be viewed at http://wileyonlinelibrary.com]

See this image and copyright information in PMC

References

1. Pearl J. Causality : models, reasoning, and inference. 2nd ed. Cambridge: Cambridge University Press; 2009.
1. Robins JM, Hernan MA, Brumback B. structural models and causal inference in epidemiology. Epidemiology. 2000;550‐560. - PubMed
1. Rothman K. Modern Epidemiology. 4th ed. Philadelphia: Lippincott Williams & Wilkins; 2016.
1. Rubin DB. inference using potential outcomes. J Am Stat Assoc. 2011;100(469):322‐331.
1. Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;66(5):688.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- H1 Connect - Access expert opinions and insights on biomedical research.
- scite Smart Citations
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Targeted maximum likelihood estimation for a binary treatment: A tutorial

Affiliations

Targeted maximum likelihood estimation for a binary treatment: A tutorial

Authors

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases