Biomedical research may uncover insights regarding the interaction of the treatments of a disease with patient covariates. We show how to use such insights to improve the efficiency of adaptive clinical trials for precision medicine by extending ideas from optimal Bayesian learning. We present a model for response-adaptive multi-arm clinical trials that leverages knowledge about the predictive-prognostic covariate structure to accelerate the learning of personalized treatment strategies that obtain the best expected outcomes for post-trial patients. Our base model is a contextual linear bandit for best-arm identification, and outcomes may be observed with delay. We characterize the optimal policy for sequentially allocating treatments to in-trial patients and, because it is hard to compute, propose several computable heuristics based on Bayesian one-step look-ahead techniques. We prove that several of our proposed heuristics are asymptotically optimal in learning treatment strategies. Numerical results based on two case studies motivated by sepsis management show that our heuristics can significantly improve clinical trial efficiency to learn a treatment strategy for precision medicine. We provide extensions that allow for rewards from outcomes of in-trial patients (resolving the exploration-exploitation tradeoff) and for inferring covariate structure using Lasso when biomedical insights on covariate structure are lacking. Our proposed trial design is of interest to funders, designers, and managers of clinical trials. It may also apply to other contextual bandit problems in settings where insights about covariate-treatment interactions are available.

This paper was accepted by Stefan Scholtes, healthcare management.

Funding: This work was supported by the Marie Skłodowska-Curie Actions under the European Union’s Horizon 2020 research and innovation programme [Grant 676129]. S. E. Chick gratefully acknowledges the support of Dr Simba Gill and Sabi Dau to the INSEAD Healthcare Management Initiative.

Supplemental Material: The online appendix and data files are available at https://doi.org/10.1287/mnsc.2022.02048.

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:July 08, 2022
Accepted:March 27, 2025
Published Online:October 29, 2025

Cite as

Andres Alban, Stephen E. Chick, Spyros I. Zoumpoulis (2025) Learning Personalized Treatment Strategies with Predictive and Prognostic Covariates in Adaptive Clinical Trials. Management Science 0(0).

https://doi.org/10.1287/mnsc.2022.02048

Keywords

Acknowledgments

The authors thank Drs. A. P. J. Vlaar, W. J. Wiersinga, F. Uhel, B. Scicluna, and N. van Mourik (Amsterdam University Medical Center) for discussions. The authors thank presentation attendees at Columbia, Duke Fuqua School of Business, INSEAD, Wharton, and at the following conferences and workshops: Applied Probability Society, MSOM, INFORMS Annual Meeting, the Workshop on Information and Learning in Decisions and Operations, and the Workshop on Technological Innovation in Health Care Delivery at the Institute for Mathematical and Statistical Innovation.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Learning Personalized Treatment Strategies with Predictive and Prognostic Covariates in Adaptive Clinical Trials

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News