Learning Personalized Treatment Strategies with Predictive and Prognostic Covariates in Adaptive Clinical Trials

Published Online:https://doi.org/10.1287/mnsc.2022.02048

References

  • Alban A, Chick SE, Zoumpoulis SI (2021) Expected value of information methods for contextual ranking and selection: Clinical trials and simulation optimization. Kim S, Feng B, Smith K, Masoud S, Zheng Z, Szabo C, Loper M, eds. Proc. Winter Simulation Conf. (IEEE, Piscataway, NJ), 1–12.CrossrefGoogle Scholar
  • Alban A, Chick SE, Zoumpoulis SI (2024) Estimating value of information arm allocation indices in contextual ranking and selection problems. Lam H, Azar E, Batur D, Gao S, Xie W, Hunter S, Rossetti M, eds. Proc. Winter Simulation Conf. (IEEE, Piscataway, NJ), 3602–3613.CrossrefGoogle Scholar
  • Anderer A, Bastani H, Silberholz J (2022) Adaptive clinical trial designs with surrogates: When should we bother? Management Sci. 68(3):1982–2002.LinkGoogle Scholar
  • Astudillo R, Jiang DR, Balandat M, Frazier PI, Bakshy E (2021) Multi-step budgeted Bayesian optimization with unknown evaluation costs. Ranzato M, Beygelzimer A, Dauphin Y, Liang PS, Wortman V, eds. Adv. Neural Inform. Processing Systems (Curran Associates, Red Hook, NY), 34.Google Scholar
  • Auer P (2002) Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. 3(November):397–422. Google Scholar
  • Ayer T, Chen Q (2018) Personalized medicine. Dai T, ed. Handbook of Healthcare Analytics (Wiley & Sons, New York), 109–135.CrossrefGoogle Scholar
  • Bai ZD, Hu F, Rosenberger WF (2002) Asymptotic properties of adaptive designs for clinical trials with delayed response. Ann. Statist. 30(1):122–129.CrossrefGoogle Scholar
  • Bastani H, Bayati M (2020) Online decision making with high-dimensional covariates. Oper. Res. 68(1):276–294.LinkGoogle Scholar
  • Bastani H, Bayati M, Khosravi K (2021) Mostly exploration-free algorithms for contextual bandits. Management Sci. 67(3):1329–1349.LinkGoogle Scholar
  • Berry DA (2011) Adaptive clinical trials in oncology. National Rev. Clinical Oncology 9(4):199–207.CrossrefGoogle Scholar
  • Bertsekas DP, Shreve SE (1978) Stochastic Optimal Control: The Discrete Time Case (Athena Scientific, Belmont, MA).Google Scholar
  • Bertsimas D, Korolko N, Weinstein AM (2019a) Covariate-adaptive optimization in online clinical trials. Oper. Res. 67(4):1150–1161.AbstractGoogle Scholar
  • Bertsimas D, Korolko N, Weinstein AM (2019b) Identifying exceptional responders in randomized trials: An optimization approach. INFORMS J. Optim. 1(3):187–199.LinkGoogle Scholar
  • Bhat N, Farias VF, Moallemi CC, Sinha D (2020) Near-optimal A-B testing. Management Sci. 66(10):4477–4495.LinkGoogle Scholar
  • Branke J, Chick S, Schmidt C (2007) Selecting a selection procedure. Management Sci. 53(12):1916–1932.LinkGoogle Scholar
  • Cakmak S, Zhou E, Gao S (2021) Contextual ranking and selection with Gaussian processes. Kim S, Feng B, Smith K, Masoud S, Zheng Z, Szabo C, Loper M, eds. Proc. Winter Simulation Conf. (IEEE, Piscataway, NJ), 1–12.CrossrefGoogle Scholar
  • Carranza AG, Krishnamurthy SK, Athey S (2023) Flexible and efficient contextual bandits with heterogeneous treatment effect oracle. Proc. 26th Internat. Conf. Artificial Intelligence Statist. (PMLR, New York), 7190–7212.Google Scholar
  • Chen X, Ankenman BE, Nelson BL (2013) Enhancing stochastic kriging metamodels with gradient estimators. Oper. Res. 61(2):512–528.LinkGoogle Scholar
  • Chen X, Shi P, Pu S (2022) Data-pooling reinforcement learning for personalized healthcare intervention. Preprint, submitted November 16, https://arxiv.org/abs/2211.08998.Google Scholar
  • Cheng Y, Berry DA (2007) Optimal adaptive randomized designs for clinical trials. Biometrika 94(3):673–689.CrossrefGoogle Scholar
  • Chick SE, Inoue K (2001) New two-stage and sequential procedures for selecting the best simulated system. Oper. Res. 49(5):732–743.LinkGoogle Scholar
  • Chick SE, Gans N, Yapar O (2022) Bayesian sequential learning for clinical trials of multiple correlated medical interventions. Management Sci. 68(7):4919–4938.LinkGoogle Scholar
  • Cutter GR, Liu Y (2012) Personalized medicine: The return of the house call? Neurology Clinical Practice 2(4):343–351.CrossrefGoogle Scholar
  • Ding L, Hong LJ, Shen H, Zhang X (2021) Technical note: Knowledge gradient for selection with covariates: Consistency and computation. Naval Res. Logist. 69(3):496–507.CrossrefGoogle Scholar
  • Foster JC, Taylor JM, Ruberg SJ (2011) Subgroup identification from randomized clinical trial data. Statist. Medicine 30(24):2867–2880.CrossrefGoogle Scholar
  • Frazier PI, Powell WB, Dayanik S (2008) A knowledge-gradient policy for sequential information collection. SIAM J. Control Optim. 47(5):2410–2439.CrossrefGoogle Scholar
  • Frazier PI, Powell WB, Dayanik S (2009) The knowledge-gradient policy for correlated normal beliefs. INFORMS J. Comput. 21(4):599–613.LinkGoogle Scholar
  • Gao S, Du J, Chen CH (2019) Selecting the optimal system design under covariates. Proc. IEEE 15th Internat. Conf. Automation Sci. Engrg. (IEEE, Piscataway, NJ), 547–552.Google Scholar
  • Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB (2013) Bayesian Data Analysis (CRC Press, Boca Raton, FL).CrossrefGoogle Scholar
  • Goldenshluger A, Zeevi A (2013) A linear response bandit problem. Stochastic Systems 3(1):230–261.LinkGoogle Scholar
  • Hu F, Rosenberger WF (2006) The Theory of Response-Adaptive Randomization in Clinical Trials (John Wiley & Sons, Hoboken, NJ).CrossrefGoogle Scholar
  • Hu F, Zhang L (2004) Asymptotic normality of urn models for clinical trials with delayed response. Bernoulli 10:447–463.CrossrefGoogle Scholar
  • Hu J, Zhu H, Hu F (2015) A unified family of covariate-adjusted response-adaptive designs based on efficiency and ethics. J. Amer. Statist. Assoc. 110:357–367.CrossrefGoogle Scholar
  • Jacko P (2018) Mitigating the curse of dimensionality of the Bayesian Beta-Bernoulli bandit problem. Proc. Workshop Multi-Armed Bandits Learn. Algorithms.Google Scholar
  • Joulani P, Gyorgy A, Szepasvari C (2013) Online learning under delayed feedback. Proc. Internat Conf. Machine Learn. (PMLR, New York), 1453–1461.Google Scholar
  • Kim SH, Nelson BL (2006) Selecting the best system. Henderson S, Nelson B, eds. Handbooks in Operations Research and Management Science (Elsevier, Burlington, NJ).Google Scholar
  • Lai TL, Liao OYW, Kim DW (2013) Group sequential designs for developing and testing biomarker-guided personalized therapies in comparative effectiveness research. Contemporary Clinical Trials 36:651–663.CrossrefGoogle Scholar
  • Lambden S, Laterre PF, Levy MM, Francois B (2019) The SOFA score—Development, utility and challenges of accurate assessment in clinical trials. Critical Care (Fullerton) 23(374):1–9.Google Scholar
  • Li H, Lam H, Liang Z, Peng Y (2022) Efficient learning for clustering and optimizing context-dependent designs. Oper. Res. 72(2):617–638.LinkGoogle Scholar
  • Lipkovich I, Dmitrienko A, D’Agostino RB Sr (2017) Tutorial in biostatistics: Data-driven subgroup identification and analysis in clinical trials. Statist. Medicine 36(1):136–196.CrossrefGoogle Scholar
  • Ma W, Li P, Zhang LX, Hu F (2024) A new and unified family of covariate adaptive randomization procedures and their properties. J. Amer. Statist. Assoc. 119(545):151–162.CrossrefGoogle Scholar
  • Negoescu D, Frazier P, Powell W (2011) The knowledge gradient algorithm for sequencing experiments in drug discovery. INFORMS J. Comput. 23(3):331–492.LinkGoogle Scholar
  • NIH (2022) Personalized medicine. Accessed December 7, 2024, https://www.nih.gov/about-nih/what-we-do/nih-turning-discovery-into-health/personalized-medicine.Google Scholar
  • O’Hagan A, Buck CE, Daneshkhah A, Eiser JR, Garthwaite PH, Jenkinson DJ, Oakley JE, et al. (2006) Uncertain Judgements: Eliciting Experts’ Probabilities (John Wiley & Sons, London).CrossrefGoogle Scholar
  • Oldenhuis C, Oosting S, Gietema J, De Vries E (2008) Prognostic versus predictive value of biomarkers in oncology. Eur. J. Cancer(1965) 44(7):946–953.CrossrefGoogle Scholar
  • Opal S, Dellinger R, Vincent J, et al. (2014) The next generation of sepsis clinical trial designs: What is next after the demise of recombinant human activated protein C? Critical Care Medicine 2014:42.Google Scholar
  • Pallmann P, Bedding AW, Choodari-Oskooei B, Dimairo M, Flight L, Hampson LV, Holmes J, et al. (2018) Adaptive designs in clinical trials: Why use them, and how to run and report them. BMC Med. 16(29):1–15.Google Scholar
  • Pearce M, Branke J (2018) Continuous multi-task Bayesian optimisation with correlation. Eur. J. Oper. Res. 270(3):1074–1085.CrossrefGoogle Scholar
  • Piantadosi S (1997) Clinical Trials: A Methodologic Perspective (Wiley, Hoboken, NJ).Google Scholar
  • Pocock SJ, Simon R (1975) Sequential treatment assignment with balancing for prognostic factors in the controlled clinical trial. Biometrics 31:103–115.CrossrefGoogle Scholar
  • Powell WB, Ryzhov IO (2012) Optimal Learning (John Wiley & Sons, Hoboken, NJ).CrossrefGoogle Scholar
  • Qin C, Russo D (2024) Optimizing adaptive experiments: A unified approach to regret minimization and best-arm identification. Preprint, submitted February 16, https://arxiv.org/abs/2402.10592.Google Scholar
  • Rello J, van Engelen TSR, Alp E, Calandra T, Cattoir V, Kern WV, Netea MG, et al. (2018) Towards precision medicine in sepsis: A position paper from the European society of clinical microbiology and infectious diseases. Clinical Microbiology Infection 24(12):1264–1272.CrossrefGoogle Scholar
  • Rigollet P, Zeevi A (2010) Nonparametric bandits with covariates. Kalai AT, Mohri M, eds. Proc. Conf. Learn. Theory (Omnipress, Madison, WI), 54–66.Google Scholar
  • Rojas-Cordova A, Bish EK (2018) Optimal patient enrollment in sequential adaptive clinical trials with binary response. Preprint, submitted August 28, https://doi.org/10.2139/ssrn.3234590.Google Scholar
  • Russo D (2020) Simple Bayesian algorithms for best arm identification. Oper. Res. 68(6):1625–1931.LinkGoogle Scholar
  • Russo D, Van Roy B (2014) Learning to optimize via posterior sampling. Math. Oper. Res. 68(4):1221–1243.LinkGoogle Scholar
  • Ryzhov IO, Powell WB (2011) Information collection on a graph. Oper. Res. 59(1):188–201.LinkGoogle Scholar
  • Ryzhov IO, Powell WB, Frazier PI (2012) The knowledge gradient algorithm for a general class of online learning problems. Oper. Res. 60(1):180–195.LinkGoogle Scholar
  • Schork NJ (2018) Randomized clinical trials and personalized medicine: A commentary on Deaton and Cartwright. Soc. Sci. Medicine 210:71–73.CrossrefGoogle Scholar
  • Scicluna BP, et al. (2017) Classification of patients with sepsis according to blood genomic endotype: A prospective cohort study. Lancet Respiratory Medicine 5(10):816–826. CrossrefGoogle Scholar
  • Sechidis K, Papangelou K, Metcalfe PD, Svensson D, Weatherall J, Brown G (2018) Distinguishing prognostic and predictive biomarkers: An information theoretic approach. Bioinformatics 34(19):3365–3376.CrossrefGoogle Scholar
  • Seymour CW, Kennedy JN, Wang S, Chang CCH, Elliott CF, XU Z, Berry S, et al. (2019) Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA 321(20):2003–2017. CrossrefGoogle Scholar
  • Shen H, Hong LJ, Zhang X (2021) Ranking and selection with covariates for personalized decision making. INFORMS J. Comput. 33(3):1500–1519.Google Scholar
  • Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, Bellomo R, et al. (2016) The third international consensus definitions for sepsis and septic shock (sepsis-3). JAMA 315(8):801–810.CrossrefGoogle Scholar
  • Symmans WF, Yau C, Chen YY, Datnow B, Wei S, Feldman MD, Ritter J, et al. (2018) Residual cancer burden (RCB) as prognostic in the I-SPY 2 trial. J. Clinical Oncology 36(15_suppl):520–520. CrossrefGoogle Scholar
  • Tada H, Fujino N, Nomura A, Nakanishi C, Hayashi K, Takamura M, Kawashiri M (2021) Personalized medicine for cardiovascular diseases. J. Human Genetics 66(1):67–74.CrossrefGoogle Scholar
  • Thompson WR (1933) On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3/4):285–294.CrossrefGoogle Scholar
  • Tibshirani R (1996) Regression shrinkage and selection via the lasso. J. Roy. Statist. Soc. Ser. B (Methodological) 58(1):267–288.CrossrefGoogle Scholar
  • Tsimberidou AM, Fountzilas E, Nikanjam M, Kurzrock R (2020) Review of precision cancer medicine: Evolution of the treatment paradigm. Cancer Treatment Rev. 86:102019.CrossrefGoogle Scholar
  • Tunc S, Alagoz O, Burnside E (2014) Opportunities for operations research in medical decision making. IEEE Intelligent Systems 29(3):59.Google Scholar
  • U.S. FDA (2023) Adjusting for covariates in randomized clinical trials for drugs and biological products. Accessed December 7, 2024, https://www.fda.gov/media/148910/download.Google Scholar
  • van Mourik N, Alban A, Uhel F, Vlaar APJ, Zoumpoulis SI, Horn J, Schultz MJ, et al. (2022) Blood transcriptomic endotypes and the response to treatment modalities in sepsis: A prospective cohort study. ESICM LIVES 2022: Part 2, Intensive Care Medicine Experiment. 10(Suppl 2):40.Google Scholar
  • Villar SS, Rosenberger WF (2018) Covariate-adjusted response-adaptive randomization for multi-arm clinical trials using a modified forward looking Gittins index rule. Biometrics 74(1):49–57.CrossrefGoogle Scholar
  • Wang Y, Powell W (2016) An optimal learning method for developing personalized treatment regimes. Preprint, submitted July 6, https://arxiv.org/abs/1607.01462.Google Scholar
  • Wang H, Yee D (2019) I-SPY 2: A neoadjuvant adaptive clinical trial designed to improve outcomes in high-risk breast cancer. Current Breast Cancer Rep. 11(4):303–310.CrossrefGoogle Scholar
  • Wang Y, Wang C, Powell W (2016) The knowledge gradient for sequential decision making with stochastic binary feedbacks. Proc. 33rd Internat. Conf. Machine Learn., vol. 48 (PMLR, New York), 1138–1147.Google Scholar
  • Wang J, Clark SC, Liu E, Frazier PI (2020) Parallel Bayesian global optimization of expensive functions. Oper. Res. 68(6):1850–1865.LinkGoogle Scholar
  • Williamson SF, Jacko P, Villar SS, Jaki T (2017) A Bayesian adaptive design for clinical trials in rare diseases. Comput. Statist. Data Anal. (Oxford) 113:136–153.CrossrefGoogle Scholar
  • Wu J, Frazier P (2016) The parallel knowledge gradient method for batch Bayesian optimization. Lee D, Sugiyama M, Luxburg U, Guyon I, Garnett R, eds. Adv. Neural Inform. Processing Systems (Curran Associates, Red Hook, NY), 29.Google Scholar
  • Wu AD, Zumbo BD (2008) Understanding and using mediators and moderators. Soc. Indicators Res. 87(3):367–392.CrossrefGoogle Scholar
  • Xie J, Frazier PI, Chick SE (2016) Bayesian optimization via simulation with pairwise sampling and correlated prior beliefs. Oper. Res. 64(2):542–559.LinkGoogle Scholar
  • Xiong S (2020) Personalized optimization and its implementation in computer experiments. IISE Trans. 52(5):528–536.CrossrefGoogle Scholar
  • Zhang LX, Hu F, Cheung SH, Chan WS (2007) Asymptotic properties of covariate-adjusted response-adaptive designs. Ann. Statist. 35(3):1166–1182.CrossrefGoogle Scholar
  • Zhao W, Ma W, Wang F, Hu F (2022) Incorporating covariates information in adaptive clinical trials for precision medicine. Pharmacy Statist. 21(1):176–195.CrossrefGoogle Scholar
  • Zimmerman JE, Kramer AA, McNair DS, Malila FM (2006) Acute physiology and chronic health evaluation (APACHE) IV: Hospital mortality assessment for today’s critically ill patients. Critical Care Medicine 34(5):1297–1310.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.