Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Sep 3;33(5):639-652.
doi: 10.1080/10543406.2023.2170402. Epub 2023 Jan 30.

Optimal dose escalation methods using deep reinforcement learning in phase I oncology trials

Affiliations

Optimal dose escalation methods using deep reinforcement learning in phase I oncology trials

Kentaro Matsuura et al. J Biopharm Stat. .

Abstract

In phase I trials of a novel anticancer drug, one of the most important objectives is to identify the maximum tolerated dose (MTD). To this end, a number of methods have been proposed and evaluated under various scenarios. However, the percentages of correct selection (PCS) of MTDs using previous methods are insufficient to determine the dose for late-phase trials. The purpose of this study is to construct an action rule for escalating or de-escalating the dose and continuing or stopping the trial to increase the PCS as much as possible. We show that deep reinforcement learning with an appropriately defined state, action, and reward can be used to construct such an action selection rule. The simulation study shows that the proposed method can improve the PCS compared with the 3 + 3 design, CRM, BLRM, BOIN, mTPI, and i3 + 3 methods.

Keywords: Adaptive design; Clinical trial; Dose-finding; Maximum tolerated dose; Optimal design.

PubMed Disclaimer

Publication types

Substances

LinkOut - more resources