TY - JOUR
T1 - Optimal dose escalation methods using deep reinforcement learning in phase I oncology trials
AU - Matsuura, Kentaro
AU - Sakamaki, Kentaro
AU - Honda, Junya
AU - Sozu, Takashi
N1 - Publisher Copyright:
© 2023 Taylor & Francis Group, LLC.
PY - 2023
Y1 - 2023
N2 - In phase I trials of a novel anticancer drug, one of the most important objectives is to identify the maximum tolerated dose (MTD). To this end, a number of methods have been proposed and evaluated under various scenarios. However, the percentages of correct selection (PCS) of MTDs using previous methods are insufficient to determine the dose for late-phase trials. The purpose of this study is to construct an action rule for escalating or de-escalating the dose and continuing or stopping the trial to increase the PCS as much as possible. We show that deep reinforcement learning with an appropriately defined state, action, and reward can be used to construct such an action selection rule. The simulation study shows that the proposed method can improve the PCS compared with the 3 + 3 design, CRM, BLRM, BOIN, mTPI, and i3 + 3 methods.
AB - In phase I trials of a novel anticancer drug, one of the most important objectives is to identify the maximum tolerated dose (MTD). To this end, a number of methods have been proposed and evaluated under various scenarios. However, the percentages of correct selection (PCS) of MTDs using previous methods are insufficient to determine the dose for late-phase trials. The purpose of this study is to construct an action rule for escalating or de-escalating the dose and continuing or stopping the trial to increase the PCS as much as possible. We show that deep reinforcement learning with an appropriately defined state, action, and reward can be used to construct such an action selection rule. The simulation study shows that the proposed method can improve the PCS compared with the 3 + 3 design, CRM, BLRM, BOIN, mTPI, and i3 + 3 methods.
KW - Adaptive design
KW - Clinical trial
KW - Dose-finding
KW - Maximum tolerated dose
KW - Optimal design
UR - http://www.scopus.com/inward/record.url?scp=85147443767&partnerID=8YFLogxK
U2 - 10.1080/10543406.2023.2170402
DO - 10.1080/10543406.2023.2170402
M3 - Article
C2 - 36717962
AN - SCOPUS:85147443767
SN - 1054-3406
VL - 33
SP - 639
EP - 652
JO - Journal of biopharmaceutical statistics
JF - Journal of biopharmaceutical statistics
IS - 5
ER -