. 2018 Nov 12;13(11):e0205839.

doi: 10.1371/journal.pone.0205839. eCollection 2018.

Improving counterfactual reasoning with kernelised dynamic mixing models

Sonali Parbhoo¹, Omer Gottesman², Andrew Slavin Ross², Matthieu Komorowski³, Aldo Faisal³, Isabella Bon⁴, Volker Roth¹, Finale Doshi-Velez²

Affiliations

¹ Department of Mathematics and Informatics, University of Basel, Basel, Switzerland.
² School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachussets, United States of America.
³ Department of Bioengineering, Imperial College, London, United Kingdom.
⁴ Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy.

PMID: 30419029
PMCID: PMC6231902
DOI: 10.1371/journal.pone.0205839

Improving counterfactual reasoning with kernelised dynamic mixing models

Sonali Parbhoo et al. PLoS One. 2018.

. 2018 Nov 12;13(11):e0205839.

doi: 10.1371/journal.pone.0205839. eCollection 2018.

Authors

Sonali Parbhoo¹, Omer Gottesman², Andrew Slavin Ross², Matthieu Komorowski³, Aldo Faisal³, Isabella Bon⁴, Volker Roth¹, Finale Doshi-Velez²

Affiliations

¹ Department of Mathematics and Informatics, University of Basel, Basel, Switzerland.
² School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachussets, United States of America.
³ Department of Bioengineering, Imperial College, London, United Kingdom.
⁴ Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy.

PMID: 30419029
PMCID: PMC6231902
DOI: 10.1371/journal.pone.0205839

Abstract

Simulation-based approaches to disease progression allow us to make counterfactual predictions about the effects of an untried series of treatment choices. However, building accurate simulators of disease progression is challenging, limiting the utility of these approaches for real world treatment planning. In this work, we present a novel simulation-based reinforcement learning approach that mixes between models and kernel-based approaches to make its forward predictions. On two real world tasks, managing sepsis and treating HIV, we demonstrate that our approach both learns state-of-the-art treatment policies and can make accurate forward predictions about the effects of treatments on unseen patients.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1**
In our model-mixing approach (a), we create a simulator that chooses between parametric (discrete POMDP) and non-parametric (kernel) approaches for performing the forward simulation and use this simulator for planning. In contrast, earlier work (b) solved for a policy using either the POMDP or the kernel, and then chose between policies at test time. Given that both the POMDP and the kernel both have their respective weaknesses, we expect policies derived from just one to be less optimal than those derived from a model that can dynamically mix between both.

**Fig 2. Illustration of dynamics for the toy example.**
The optimal sequence of actions for a type A variant is to initially take action 1 or 2, followed by action 1. For type B variants, the optimal sequence of actions is to first take actions 1 or 2, followed by action 2.

**Fig 3. Simulating the viral load in an HIV patient when the viral load is below detection limits (indicated by 0).**
KDM can detect the occurrence of blips at 12 and 30 months, unlike a MoE. No treatment change should be administered here.

**Fig 4. Comparison of predictive log-likelihood across baselines for HIV for a typical test patient.**
KDM’s predictions are more accurate across the forward time steps.

**Fig 5. Box plot of viral load predictions across 3000 test patients under baselines over a 30-month horizon.**
KDM’s predictions are closer to the ground truth than POMDP or kernel predictions.

**Fig 6. Simulating the SpO₂ of a sepsis test patient under baselines over a 20-hour horizon.**
Counterfactual predictions of SpO₂ levels are more accurate using KDM than existing baselines.

**Fig 7. Comparison of predictive log-likelihood across baselines for sepsis for a typical test patient.**
KDM’s predictions are more accurate across the forward time steps.

**Fig 8. Box plot of SpO₂ predictions across 3000 test patients under baselines over a 20-hour horizon.**
KDM’s predictions are closer to the ground truth than POMDP or kernel predictions.

**Fig 9**
Distributions of frequencies of non-zero IS weights for (a) HIV and (b) sepsis respectively. Our treatments are fairly consistent with those in the data sets.

See this image and copyright information in PMC

References

1. Miotto R, Li L, Kidd BA, Dudley JT. Deep patient: An unsupervised representation to predict the future of patients from the electronic health records. Scientific Reports. 2016;6(26094). - PMC - PubMed
1. Choi E, Bahadori MT, Schuetz A, Stewart WF, Sun J. Doctor AI: Predicting clinical events via recurrent neural networks. In: Machine Learning for Healthcare Conference; 2016. - PMC - PubMed
1. Che Z, Kale D, Li W, Bahadori MT, Liu Y. Deep computational phenotyping. In: KDD; 2015.
1. Bogojeska J, Stöckel D, Zazzi M, Kaiser R, Incardona F, Rosen-Zvi M, et al. History-alignment models for bias-aware prediction of virological response to HIV combination therapy. In: AISTATS; 2012. p. 118–126.
1. Rabinowitz M, et al. Accurate prediction of HIV-1 drug response from the reverse transcriptase and protease amino acid sequences using sparse models created by convex optimization. Bioinformatics. 2005;22(5):541–549. 10.1093/bioinformatics/btk011 - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Improving counterfactual reasoning with kernelised dynamic mixing models

Affiliations

Improving counterfactual reasoning with kernelised dynamic mixing models

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical