. 2023 Aug:216:1047-1057.

Assessing the Impact of Context Inference Error and Partial Observability on RL Methods for Just-In-Time Adaptive Interventions

Karine Karine¹, Predrag Klasnja², Susan A Murphy³, Benjamin M Marlin¹

Affiliations

PMID: 37724310
PMCID: PMC10506656

Assessing the Impact of Context Inference Error and Partial Observability on RL Methods for Just-In-Time Adaptive Interventions

Karine Karine et al. Proc Mach Learn Res. 2023 Aug.

. 2023 Aug:216:1047-1057.

Authors

Karine Karine¹, Predrag Klasnja², Susan A Murphy³, Benjamin M Marlin¹

Affiliations

¹ University of Massachusetts Amherst.
² University of Michigan.
³ Harvard University.

PMID: 37724310
PMCID: PMC10506656

Abstract

Just-in-Time Adaptive Interventions (JITAIs) are a class of personalized health interventions developed within the behavioral science community. JITAIs aim to provide the right type and amount of support by iteratively selecting a sequence of intervention options from a pre-defined set of components in response to each individual's time varying state. In this work, we explore the application of reinforcement learning methods to the problem of learning intervention option selection policies. We study the effect of context inference error and partial observability on the ability to learn effective policies. Our results show that the propagation of uncertainty from context inferences is critical to improving intervention efficacy as context uncertainty increases, while policy gradient algorithms can provide remarkable robustness to partially observed behavioral state information.

Keywords: Reinforcement learning; adaptive interventions; context inference; empirical evaluation; mobile health; partial observability.

PubMed Disclaimer

Figures

**Figure 1:**
Context inference error as a function of $σ$ .

**Figure 2:**
(a) Effect of learning with most likely context and context probabilities for DQN. (b) Effect of learning with most likely context and context probabilities for REINFORCE. (c) Effect of learning with most likely contexts and partial observability for REINFORCE and DQN. (d) Effect of learning with context probabilities and partial observability for REINFORCE and DQN.

**Figure 3:**
The top row of plots shows the distribution of actions selected by REINFORCE when given access to context probabilities. The bottom row of plots shows the distribution of actions selected by REINFORCE when given access only to the inferred most likely context.

**Figure 4:**
Learning curves of DQN and REINFORCE.

**Figure 5:**
The top row of plots shows the distribution of actions selected by DQN when given access to context probabilities. The bottom row of plots shows the distribution of actions selected by DQN when given access only to the inferred most likely context.

**Figure 6:**
Performance as a function of the disengagement increment $ϵ_{d}$ and decay parameters $δ_{d}$ , for DQN (top row) and REINFORCE (bottom row).

See this image and copyright information in PMC

References

1. Battalio Samuel L, Conroy David E, Dempsey Walter, Liao Peng, Menictas Marianne, Murphy Susan, Nahum-Shani Inbal, Qian Tianchen, Kumar Santosh, and Spring. Bonnie Sense2stop: a micro-randomized trial using wearable sensors to optimize a just-in-time-adaptive stress management intervention for smoking relapse prevention. Contemporary Clinical Trials, 109:106534, 2021. - PMC - PubMed
1. Bruin Tim de, Kober Jens, Tuyls Karl, and Babuška. Robert The importance of experience replay database composition in deep reinforcement learning. In Deep Reinforcement Learning Workshop, Advances in Neural Information Processing Systems, 2015.
1. Ertin Emre, Stohs Nathan, Kumar Santosh, Raij Andrew, Al’Absi Mustafa, and Shah. Siddharth Autosense: unobtrusively wearable sensor suite for inferring the onset, causality, and consequences of stress in the field. In Proceedings of the 9th ACM conference on embedded networked sensor systems, pages 274–287, 2011.
1. Gönül Suat, Namlı Tuncay, Coşar Ahmet, and Toroslu İsmail Hakkı. A reinforcement learning based algorithm for personalization of digital, just-in-time, adaptive interventions. Artificial Intelligence in Medicine, 115:102062, 2021. - PubMed
1. Hardeman Wendy, Houghton Julie, Lane Kathleen, Jones Andy, and Naughton Felix. A systematic review of just-in-time adaptive interventions (jitais) to promote physical activity. International Journal of Behavioral Nutrition and Physical Activity, 16(1):1–21, 2019. - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Assessing the Impact of Context Inference Error and Partial Observability on RL Methods for Just-In-Time Adaptive Interventions

Affiliations

Assessing the Impact of Context Inference Error and Partial Observability on RL Methods for Just-In-Time Adaptive Interventions

Authors

Affiliations

Abstract

Figures

Similar articles

References

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources