Targeted maximum likelihood based causal inference: Part I

Mark J van der Laan¹

Affiliations

PMID: 21969992
PMCID: PMC3126670
DOI: 10.2202/1557-4679.1211

Targeted maximum likelihood based causal inference: Part I

Mark J van der Laan. Int J Biostat. 2010.

. 2010;6(2):Article 2.

doi: 10.2202/1557-4679.1211.

Author

Mark J van der Laan¹

Affiliation

¹ University of California - Berkeley, CA, USA.

PMID: 21969992
PMCID: PMC3126670
DOI: 10.2202/1557-4679.1211

Abstract

Given causal graph assumptions, intervention-specific counterfactual distributions of the data can be defined by the so called G-computation formula, which is obtained by carrying out these interventions on the likelihood of the data factorized according to the causal graph. The obtained G-computation formula represents the counterfactual distribution the data would have had if this intervention would have been enforced on the system generating the data. A causal effect of interest can now be defined as some difference between these counterfactual distributions indexed by different interventions. For example, the interventions can represent static treatment regimens or individualized treatment rules that assign treatment in response to time-dependent covariates, and the causal effects could be defined in terms of features of the mean of the treatment-regimen specific counterfactual outcome of interest as a function of the corresponding treatment regimens. Such features could be defined nonparametrically in terms of so called (nonparametric) marginal structural models for static or individualized treatment rules, whose parameters can be thought of as (smooth) summary measures of differences between the treatment regimen specific counterfactual distributions. In this article, we develop a particular targeted maximum likelihood estimator of causal effects of multiple time point interventions. This involves the use of loss-based super-learning to obtain an initial estimate of the unknown factors of the G-computation formula, and subsequently, applying a target-parameter specific optimal fluctuation function (least favorable parametric submodel) to each estimated factor, estimating the fluctuation parameter(s) with maximum likelihood estimation, and iterating this updating step of the initial factor till convergence. This iterative targeted maximum likelihood updating step makes the resulting estimator of the causal effect double robust in the sense that it is consistent if either the initial estimator is consistent, or the estimator of the optimal fluctuation function is consistent. The optimal fluctuation function is correctly specified if the conditional distributions of the nodes in the causal graph one intervenes upon are correctly specified. The latter conditional distributions often comprise the so called treatment and censoring mechanism. Selection among different targeted maximum likelihood estimators (e.g., indexed by different initial estimators) can be based on loss-based cross-validation such as likelihood based cross-validation or cross-validation based on another appropriate loss function for the distribution of the data. Some specific loss functions are mentioned in this article. Subsequently, a variety of interesting observations about this targeted maximum likelihood estimation procedure are made. This article provides the basis for the subsequent companion Part II-article in which concrete demonstrations for the implementation of the targeted MLE in complex causal effect estimation problems are provided.

PubMed Disclaimer

Cited by

Comparing results from multiple imputation and dynamic marginal structural models for estimating when to start antiretroviral therapy.
Shepherd BE, Liu Q, Mercaldo N, Jenkins CA, Lau B, Cole SR, Saag MS, Sterling TR. Shepherd BE, et al. Stat Med. 2016 Oct 30;35(24):4335-4351. doi: 10.1002/sim.7007. Epub 2016 Jun 6. Stat Med. 2016. PMID: 27264354 Free PMC article.
Long-Term Associations between Disaster-Related Home Loss and Health and Well-Being of Older Survivors: Nine Years after the 2011 Great East Japan Earthquake and Tsunami.
Shiba K, Hikichi H, Okuzono SS, VanderWeele TJ, Arcaya M, Daoud A, Cowden RG, Yazawa A, Zhu DT, Aida J, Kondo K, Kawachi I. Shiba K, et al. Environ Health Perspect. 2022 Jul;130(7):77001. doi: 10.1289/EHP10903. Epub 2022 Jul 1. Environ Health Perspect. 2022. PMID: 35776697 Free PMC article.
Robust estimation of encouragement-design intervention effects transported across sites.
Rudolph KE, van der Laan MJ. Rudolph KE, et al. J R Stat Soc Series B Stat Methodol. 2017 Nov;79(5):1509-1525. doi: 10.1111/rssb.12213. Epub 2016 Oct 31. J R Stat Soc Series B Stat Methodol. 2017. PMID: 29375249 Free PMC article.
A calibration approach to transportability and data-fusion with observational data.
Josey KP, Yang F, Ghosh D, Raghavan S. Josey KP, et al. Stat Med. 2022 Oct 15;41(23):4511-4531. doi: 10.1002/sim.9523. Epub 2022 Jul 18. Stat Med. 2022. PMID: 35848098 Free PMC article.
Dynamic multi-outcome prediction after injury: Applying adaptive machine learning for precision medicine in trauma.
Christie SA, Conroy AS, Callcut RA, Hubbard AE, Cohen MJ. Christie SA, et al. PLoS One. 2019 Apr 10;14(4):e0213836. doi: 10.1371/journal.pone.0213836. eCollection 2019. PLoS One. 2019. PMID: 30970030 Free PMC article.

See all "Cited by" articles

References

1. Abadie A, Imbens GW. Large sample properties of matching estimators for average treatment effects. Econometrica. 2006;74:235–67. doi: 10.1111/j.1468-0262.2006.00655.x. - DOI
1. Andersen PK, Borgan O, Gill RD, Keiding N. Statistical Models Based on Counting Processes. Springer-Verlag; New York: 1993.
1. Bembom O, Petersen ML, Rhee S-Y, Fessel WJ, Sinisi SE, Shafer RW, van der Laan MJ. Biomarker discovery using targeted maximum likelihood estimation: Application to the treatment of antiretroviral resistant hiv infection. Statistics in Medicine. page http://www3.interscience.wiley.com/journal/121422393/abstract, 2008. - PMC - PubMed
1. Bembom O, van der Laan MJ, Haight T, Tager IB. Lifetime and current leisure time physical activity and all-cause mortality in an elderly cohort. Epidemiology. 2009 doi: 10.1097/EDE.0b013e31819e3f28. - DOI - PMC - PubMed
1. Bembom Oliver, van der Laan Mark. Statistical methods for analyzing sequentially randomized trials, commentary on jnci article adaptive therapy for androgen independent prostate cancer: A randomized selection trial including four regimens, by peter f. thall, c. logothetis, c. pagliaro, s. wen, m.a. brown, d. williams, r. millikan 2007. Journal of the National Cancer Institute. 2007;99(21):1577–1582. doi: 10.1093/jnci/djm185. - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

R01 AI074345/AI/NIAID NIH HHS/United States

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Targeted maximum likelihood based causal inference: Part I

Affiliation

Targeted maximum likelihood based causal inference: Part I

Author

Affiliation

Abstract

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Similar articles

Cited by

References

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources