Data-Driven Hospital Admission Control: A Learning Approach
References
- (2011) Improved algorithms for linear stochastic bandits. Shawe-Taylor J, Zemel RS, Bartlett PL, Pereira FCN, Weinberger KQ, eds. Adv. Neural Inform. Processing Systems (NIPS), vol. 24 (Curran Associates, Red Hook, NY), 2312–2320.Google Scholar
- (2014) Bandits with concave rewards and convex knapsacks. Proc. 15th ACM Conf. Econom. Comput. (ACM, Palo Alto, CA), 989–1006.Google Scholar
- (2016) Linear contextual bandits with knapsacks. Lee DD, Sugiyama M, Luxburg UV, Guyon I, Garnett R, eds. Adv. Neural Inform. Processing Systems (NIPS), vol. 29 (Curran Associates Inc., Red Hook, NY), 3450–3458.Google Scholar
- (2016) An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives. J. Machine Learn. Res. 49:4–18.Google Scholar
- (2016) Response-adaptive designs for clinical trials: Simultaneous learning from multiple patients. Eur. J. Oper. Res. 248(2):619–633.Crossref, Google Scholar
- (2002) Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. Res. 3:397–422.Google Scholar
- (2002) Finite-time analysis of the multiarmed bandit problem. Machine Learn. 47(2):235–256.Crossref, Google Scholar
- (2013) Bandits with knapsacks. Proc. 54th Annual Sympos. Foundations Comput. Sci. (IEEE, Piscataway, NJ), 207–216.Google Scholar
- (2014) Resourceful contextual bandits. Proc. 27th Annual Conf. Learn. Theory, vol. 35 (PMLR, New York), 1109–1134.Google Scholar
- (2019) Contextual bandits with cross-learning. Wallach H, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox E, Garnett R, eds. Adv. Processing Systems (NIPS), vol. 32 (Curran Associates, Red Hook, NY), 9676–9685.Google Scholar
- (2020) Online decision-making with high-dimensional covariates. Oper. Res. 68(1):276–294.Google Scholar
- (2021) Mostly exploration-free algorithms for contextual bandits. Management Sci. 67(3):1329–1349.Google Scholar
- (2000) Asymptotic optimality of tracking policies in stochastic networks. Ann. Appl. Probab. 10(4):1065–1083.Crossref, Google Scholar
- (2000) Hospital readmissions as a measure of quality of healthcare: Advantages and limitations. Arch. Internal Medicine 160(8):1074–1081.Crossref, Google Scholar
- (2019) Learning and hierarchies in service systems. Management Sci. 65(3):1268–1285.Link, Google Scholar
- (2021) Dynamic programs with shared resources and signals: Dynamic fluid policies and asymptotic optimality. Oper. Res. 70(5):3015–3033.Link, Google Scholar
- (2018) Assessing the impact of service level when customer needs are uncertain: An empirical investigation of hospital step-down units. Management Sci. 65(2):751–775.Link, Google Scholar
- (2021) Optimal routing under demand surges: The value of future arrival rates. Preprint, submitted December 14, https://dx.doi.org/10.2139/ssrn.3980227.Google Scholar
- (2013) Use of health IT for higher-value critical care. New England J. Medicine 368(7):594–597.Crossref, Google Scholar
- (2012) Intensive care unit admitting patterns in the Veterans Affairs healthcare system. Arch. Internal Medicine 172(16):1220–1226.Crossref, Google Scholar
- (2017) Revenue management of reusable resources with advanced reservations. Production Oper. Management 26(5):836–859.Crossref, Google Scholar
- (2022) Hedging the drift: Learning to optimize under nonstationarity. Management Sci. 68(3):1696–1713.Google Scholar
- (2011) Contextual bandits with linear payoff functions. Proc. 14th Internat. Conf. Artificial Intelligence Statist., vol. 15 (PMLR, New York), 208–214.Google Scholar
- (2013) Mortality predictions on admission as a context for organizing care activities. J. Hospital Medicine 8(5):229–235.Crossref, Google Scholar
- (2019) Inpatient overflow: An approximate dynamic programming approach. Manufacturing Service Oper. Management 21(4):894–911.Link, Google Scholar
- (2008) Stochastic linear optimization under bandit feedback. Working paper, University of Chicago, IL.Google Scholar
- (2019) Off-service placement in inpatient ward network: Resource pooling vs. service slowdown. Columbia Business School Research Paper, New York.Google Scholar
- (2017) A practical method for solving contextual bandit problems using decision trees. Preprint, submitted June 14, https://arxiv.org/abs/1706.04687.Google Scholar
- (2010) Parametric bandits: The generalized linear case. Lafferty JD, Williams CKI, Shawe-Taylor J, Zemel RS, Culotta A, eds. Adv. Neural Inform. Processing Systems, vol. 23 (Curran Associates, Inc., New York), 586–594.Google Scholar
- (2020) Managing flexible products on a network. Preprint, submitted April 27, http://dx.doi.org/10.2139/ssrn.3567371.Google Scholar
- (2013) Data shows reduction in Medicare hospital readmission rates during 2012. Medicare Medicaid Res. Rev. 3(2):E1–E11.Crossref, Google Scholar
- (2007) Critical care medicine use and cost among Medicare beneficiaries 1995–2000: Major discrepancies between two United States federal Medicare databases. Critical Care Medicine 35(3):692–699.Crossref, Google Scholar
- (2014) Design and optimization methods for elective hospital admissions. Oper. Res. 62(6):1265–1282.Link, Google Scholar
- (2016) Reducing hospital readmissions by integrating empirical prediction with resource optimization. Production Oper. Management 25(2):233–257.Crossref, Google Scholar
- (2021) Matching while learning. Oper. Res. 69(2):655–681.Link, Google Scholar
- (2011) How to solve the cost crisis in healthcare. Harvard Bus. Rev. 89(9):46–52.Google Scholar
- (2019) Contextual learning with online convex optimization: Theory and application to chronic diseases. Preprint, submitted December 31, https://dx.doi.org/10.2139/ssrn.3501316.Google Scholar
- (2015) ICU admission control: An empirical study of capacity allocation and its implication for patient outcomes. Management Sci. 61(1):19–38.Link, Google Scholar
- (2016) Regret of queueing bandits. Lee DD, von Luxburg U, Garnett R, Sugiyama M, Guyon I, eds. Adv. Neural Inform. Processing Systems, vol. 30 (Curran Associates, Red Hook, NY), 1669–1677.Google Scholar
- (2021) Learning unknown service rates in queues: A multiarmed bandit approach. Oper. Res. 69(1):315–330.Link, Google Scholar
- (2020) Bandit Algorithms (Cambridge University Press).Crossref, Google Scholar
- (2020) Real-time dynamic pricing for revenue management with reusable resources, advance reservation, and deterministic service time requirements. Oper. Res. 68(3):676–685.Link, Google Scholar
- (2010) Provably near-optimal LP-based policies for revenue management in systems with reusable resources. Oper. Res. 58(2):503–507.Link, Google Scholar
- (2017) Provably optimal algorithms for generalized linear contextual bandits. Proc. 34th Internat. Conf. Machine Learn. vol. 70 (Curran Associates Inc., New York), 2071–2080.Google Scholar
- (2010) A contextual-bandit approach to personalized news article recommendation. Proc. 19th Internat. Conf. World Wide Web (ACM, New York), 661–670.Google Scholar
- (2008) On the choice-based linear programming model for network revenue management. Manufacturing Service Oper. Management 10(2):288–310.Link, Google Scholar
- (2000) Discrete-review policies for scheduling stochastic networks: Trajectory tracking and fluid-scale asymptotic optimality. Ann. Appl. Probab. 10(3):897–929.Crossref, Google Scholar
- (2014) Reduced length of hospital stay in colorectal surgery after implementation of an enhanced recovery protocol. Anesthesia Analgesia 118(5):1052–1061.Crossref, Google Scholar
- (2020) Non-stationary bandits with habituation and recovery dynamics. Oper. Res. 68(5):1493–1516.Google Scholar
- (2017) Dynamic learning of patient response types: An application to treating chronic diseases. Management Sci. 64(8):3469–3488.Link, Google Scholar
- (2004) Modifying unwarranted variations in healthcare: Shared decision making using patient decision aids: A review of the evidence base for shared decision making. Health Affairs 23(Suppl2):VAR–63.Crossref, Google Scholar
- (2018) Price and assortment optimization for reusable resources. Preprint, submitted November 16, 2017, https://dx.doi.org/10.2139/ssrn.3070625.Google Scholar
- (2014) The role of stepdown beds in hospital care. Amer. J. Respiratory Critical Care Medicine 190(11):1210–1216.Crossref, Google Scholar
- (2012) Monitoring costs in the ICU: A search for a pertinent methodology. Acta Anaesthesiologica Scandinavica 56(9):1104–1113.Crossref, Google Scholar
- (2010) Linearly parameterized bandits. Math. Oper. Res. 35(2):395–411.Link, Google Scholar
- (2014) Learning to optimize via posterior sampling. Math. Oper. Res. 39(4):1221–1243.Link, Google Scholar
- (2017) Managing patient admissions in a neurology ward. Oper. Res. 65(3):635–656.Link, Google Scholar
- (2020) Adaptive matching for expert systems with uncertain task types. Oper. Res. 68(5):1403–1424.Link, Google Scholar
- (2003) Optimizing admissions to an intensive care unit. Health Care Management Sci. 6(3):131–136.Crossref, Google Scholar
- (2019) Introduction to multi-armed bandits. Foundations and Trends in Machine Learning, vol. 12 (Now Publishers, Boston).Google Scholar
- (2015) Standardization of care: Impact of an enhanced recovery protocol on length of stay, complications, and direct costs after colorectal surgery. J. Amer. College Surgeons 220(4):430–443.Crossref, Google Scholar
- (2016) The knowledge gradient for sequential decision making with stochastic binary feedbacks. Lawrence N, Reid M, eds. Internat. Conf. Machine Learn. (PMLR, New York), 1138–1147.Google Scholar
- (2019) An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits. Adv. Appl. Probab. 51(3):745–772.Crossref, Google Scholar

