Robust Control of Partially Observable Failing Systems

Published Online:https://doi.org/10.1287/opre.2016.1495

References

  • Ben-Tal A, Nemirovski A (1998) Robust convex optmization. Math. Oper. Res. 23(4):769–805.LinkGoogle Scholar
  • Ben-Tal A, Nemirovski A (1999) Robust solutions to uncertain programs. Oper. Res. Lett. 25:1–13.CrossrefGoogle Scholar
  • Ben-Tal A, Nemirovski A (2000) Robust solutions of linear programming problems contaminated with uncertain data. Math. Program. 88: 411–424.CrossrefGoogle Scholar
  • Bertsekas DP, Shreve SE (1978) Stochastic Optimal Control: The Discrete Time Case (Academic Press, NY).Google Scholar
  • Bertsimas D, Sim M (2004) The price of robustness. Oper. Res. 52(1): 35–53.LinkGoogle Scholar
  • Boyd S, Vandenberghe L (2004) Convex Optimization (Cambridge University Press, Cambridge, UK).CrossrefGoogle Scholar
  • Brown DB, Smith JE, Sun P (2010) Information relaxations and duality in stochastic dynamic programs. Oper. Res. 58(4):785–801.LinkGoogle Scholar
  • Caro F, Gupta AD (2015) Robust control of the multi-armed bandit problem. Ann. Oper. Res., ePub ahead of print August 21, 2015, http://dx.doi.org/10.1007/s10479-015-1965-7.CrossrefGoogle Scholar
  • Çavuş Ö, Ruszczyński A (2014a) Computational methods for risk-averse undiscounted transient Markov models. Oper. Res. 62(2):401–417.LinkGoogle Scholar
  • Çavuş Ö, Ruszczyński A (2014b) Risk-averse control of undiscounted transient Markov models. SIAM J. Control Optim. 52:3935–3966.CrossrefGoogle Scholar
  • Chan CW, Farias VF (2009) Stochastic depletion problems: Effective myopic policies for a class of dynamic optimization problems. Math. Oper. Res. 34(2):333–350.LinkGoogle Scholar
  • Dai Pra P, Meneghini L, Runggaldier WJ (1996) Connections between stochastic control and dynamic games. Math. Control Signal 9:303–326.CrossrefGoogle Scholar
  • Dayanik S, Gurler U (2002) An adaptive Bayesian replacement policy with minimal repair. Oper. Res. 50(3):552–558.LinkGoogle Scholar
  • Dayanik S, Goulding C, Poor VH (2008) Bayesian sequential change diagnosis. Math. Oper. Res. 33(2):475–496.LinkGoogle Scholar
  • Dupuis P, Ellis RS (1997) A Weak Convergence Approach to the Theory of Large Deviations (John Wiley & Sons, NY).CrossrefGoogle Scholar
  • El Ghaoui L, Lebret H (1997) Robust solutions to least-square problems to uncertain data matrices. SIAM J. Matrix Anal. Appl. 18:1035–1064.CrossrefGoogle Scholar
  • Elwany AH, Gebraeel NZ, Maillart LM (2011) Structured replacement policies for components with complex degradation processes and dedicated sensors. Oper. Res. 59(3):684–695.LinkGoogle Scholar
  • González-Trejo JI, Hernández-Lerma O, Hoyos-Reyes LF (2003) Minimax control of discrete-time stochastic systems. SIAM J. Control Optim. 41:1626–1659.CrossrefGoogle Scholar
  • Gotoh J, Kim MJ, Lim AEB (2016) Robust empirical optimization is almost the same as mean variance optimization. Working paper, Chuo University, Tokyo.Google Scholar
  • Greenberg HJ, Pierskalla WP (1971) A review of quasiconvex functions. Oper. Res. 19(1):1553–1570.LinkGoogle Scholar
  • Hansen LP, Sargent TJ (2007) Robust estimation and control without commitment. J. Econom. Theory 136:1–27.CrossrefGoogle Scholar
  • Iyengar GN (2005) Robust dynamic programming. Math. Oper. Res. 30(2):257–280.LinkGoogle Scholar
  • Jain A, Lim AEB, Shanthikumar JG (2010) On the optimality of threshold control in queues with model uncertainty. Queueing Sy. 65:157–174.CrossrefGoogle Scholar
  • Kim MJ, Lim AEB (2015) Robust multi-armed bandit problems. Management Sci. 62:264–285.Google Scholar
  • Kim MJ, Makis V (2012) Optimal control of a partially observable failing system with costly multivariate observations. Stoch. Model. 28: 584–608.CrossrefGoogle Scholar
  • Kim MJ, Makis V (2013) Joint optimization of sampling and control of partially observable failing systems. Oper. Res. 61(3):777–790.LinkGoogle Scholar
  • Kurt M, Kharoufeh JP (2010) Monotone optimal replacement policies for a Markovian deteriorating system in a controllable environment. Oper. Res. Lett. 38:273–279.CrossrefGoogle Scholar
  • Li JY, Kwon RH (2013) Portfolio selection under model uncertainty: A penalized moment-based optimization approach. J. Global Optim. 56:131–164.CrossrefGoogle Scholar
  • Lim AEB, Shanthikumar JG (2007) Relative entropy, exponential utility, and robust dynamic pricing. Oper. Res. 55(2):198–214.LinkGoogle Scholar
  • Maillart LM (2006) Maintenance policies for systems with condition monitoring and obvious failures. IIE Trans. 38:463–475.CrossrefGoogle Scholar
  • Makis V (2008) Multivariate Bayesian control chart. Oper. Res. 56(2):487–496.LinkGoogle Scholar
  • Makis V, Jardine AKS (1992) Optimal replacement in the proportional hazards model. INFOR 30:172–183.Google Scholar
  • Mannor S, Mebel O, Xu H (2012) Lightning does not strike twice: Robust MDPs with coupled uncertainty. Langford J, Pineau J, eds. Proc. 29th Internat. Conf. Machine Learn. (ACM, New York), 385–392.Google Scholar
  • Munkres JR (2000) Topology (Prentice Hall, Upper Saddle River, NJ).Google Scholar
  • Nilim A, El Ghaoui L (2005) Robust control of Markov decision process with uncertain transition matrix. Oper. Res. 53(5):780–798.LinkGoogle Scholar
  • Papadimitriou CH, Tsitsiklis JN (1987) The complexity of Markov decision processes. Math. Oper. Res. 12(3):441–450.LinkGoogle Scholar
  • Peterson IR, James MR, Dupuis P (2000) Minimax optimal control of stochastic uncertain systems with relative entropy constraints. IEEE T. Automat. Contr. 45:398–412.CrossrefGoogle Scholar
  • Rogers LCG (2007) Pathwise stochastic optimal control. SIAM J. Control Optim. 46:1116–1132.CrossrefGoogle Scholar
  • Ross SM (1971) Quality control under Markovian deterioration. Management Sci. 17:587–596.LinkGoogle Scholar
  • Ruszczyński A (2010) Risk-averse dynamic programming for Markov decision processes. Math. Program., Ser. B 125:235–261.CrossrefGoogle Scholar
  • Ruszczyński A, Shapiro A (2006) Conditional risk mappings. Math. Oper. Res. 31(3):544–561.LinkGoogle Scholar
  • Stachowiak GW, Batchelor AW, Stachowiak GB (2004) Experimental Methods in Tribology (Elsevier, Amsterdam).Google Scholar
  • Ulukus MY, Kharoufeh JP, Maillart LM (2012) Optimal replacement policies under environment-driven degradation. Probab. Eng. Inform. Sci. 26:405–424.CrossrefGoogle Scholar
  • Wiesemann W, Kuhn D, Rustem B (2013) Robust Markov decision processes. Math. Oper. Res. 38(1):153–183.LinkGoogle Scholar
  • White CC-III (1978) Optimal inspection and repair of a production process subject to deterioration. J. Oper. Res. Soc. 29:235–243.CrossrefGoogle Scholar
  • Xu H, Mannor S (2012) Distributionally robust Markov decision processes. Math. Oper. Res. 37(2):288–300.LinkGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.