An Approximation Approach for the Deviation Matrix of Continuous-Time Markov Processes with Application to Markov Decision Theory

Published Online:https://doi.org/10.1287/opre.1090.0786

References

  • Altman E., Feinberg E., Shwartz A. Applications of Markov decision processes in communication networks. Handbook of Markov Decision Processes (2002) (Kluwer, Boston) 489–536CrossrefGoogle Scholar
  • Altman E., Hordijk A. Zero-sum Markov games and worst-case optimal control of queueing systems. Queueing Systems (1995) 21:415–447CrossrefGoogle Scholar
  • Altman E., Gaujal B., Hordijk A.Discrete-Event Control of Stochastic Networks: Multimodularity and Regularity (2003) 1829(Springer, New York) Springer Lecture Notes in MathematicsCrossrefGoogle Scholar
  • Altman E., Hordijk A., Spieksma F. Contraction conditions for average and α-discount optimality in countable state Markov games with unbounded rewards. Math. Oper. Res. (1997) 22:588–618LinkGoogle Scholar
  • Arapostathis A., Borkar V., Fernández-Gaucherand E., Ghosh M., Marcus S. Discrete-time controlled Markov processes with average cost criterion: A survey. SIAM J. Control Optim. (1993) 31:282–344CrossrefGoogle Scholar
  • Asmussen S.Applied Probability and Queues (1987) (Springer, New York) Google Scholar
  • Bather J. Optimal stationary policies for denumerable Markov chains in continuous time. Adv. Appl. Probab. (1976) 8:144–158CrossrefGoogle Scholar
  • Bertsekas D.Dynamic Programming and Optimal Control (2005) 3rd ed.(Athena Scientific, Nashua, NH) Google Scholar
  • Brown L., Gans N., Mandelbaum A., Sakov A., Shen H., Zeltyn S., Zhao L. Statistical analysis of a telephone call center: A queueing-science perspective. J. Amer. Statist. Assoc. (2005) 469:36–50CrossrefGoogle Scholar
  • Cao X.Stochastic Learning and Optimization: A Sensitivity-Based Approach (2007) (Springer, New York) CrossrefGoogle Scholar
  • Chao X., Zhao Y. Analysis of multi-server queues with station and server vacations. Eur. J. Oper. Res. (1997) 110:392–406CrossrefGoogle Scholar
  • Coolen-Schrijner P., van Doorn E. The deviation matrix of a continuous-time Markov chain. Probab. Engrg. Informational Sci. (2002) 16:351–366CrossrefGoogle Scholar
  • Dekker R., Hordijk A. Average, sensitive and Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards. Math. Oper. Res. (1988) 13:395–420LinkGoogle Scholar
  • Dekker R., Hordijk A. Denumerable semi-Markov decision chains with small interest rates. Ann. Oper. Res. (1991) 28:185–212CrossrefGoogle Scholar
  • Dekker R., Hordijk A. Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains. Math. Oper. Res. (1992) 17:271–289LinkGoogle Scholar
  • Dekker R., Hordijk A., Spieksma F. On the relation between recurrence and ergodicity properties in denumerable Markov decision chains. Math. Oper. Res. (1994) 19:539–559LinkGoogle Scholar
  • Dynkin E., Yushkevich A.Controlled Markov Processes (1979) (Springer-Verlag, New York) CrossrefGoogle Scholar
  • Feinberg E., Shwartz A.Handbook of Markov Decision Processes: Methods and Applications (2002) (Kluwer, Boston) CrossrefGoogle Scholar
  • Guo X. Continuous-time Markov decision processes with discounted rewards: The case of Polish spaces. Math. Oper. Res. (2007) 32:73–87LinkGoogle Scholar
  • Guo X., Cao X. Optimal control of ergodic continuous-time Markov chains with average sample-path rewards. SIAM J. Control Optim. (2005) 44:29–48CrossrefGoogle Scholar
  • Guo X., Hernández-Lerma O. Continuous-time controlled Markov chains with discounted rewards. Acta Appl. Math. (2003) 79:195–216CrossrefGoogle Scholar
  • Guo X., Liu K. A note on optimality conditions for continuous-time Markov decision processes with average cost criterion. IEEE Trans. Automatic Control (2001) 46:1984–1989CrossrefGoogle Scholar
  • Guo X., Hernández-Lerma O., Prieto-Rumeau T. A survey of recent results on continuous-time Markov decision processes. Sociedad de Estadistica e Investigación Operativa TOP (2006) 14:177–261Google Scholar
  • Heidergott B., Hordijk A., Leder N. Series expansions for continuous-time Markov processes. Oper. Res. (2009) . ePub ahead of print October 28, http://or.journal.informs.org/cgi/content/abstract/opre.1090.0738v1Google Scholar
  • Heidergott B., Hordijk A., van Uitert M. Series expansions for finite-state Markov chains. Probab. Engrg. Informational Sci. (2007) 21:381–400CrossrefGoogle Scholar
  • Hordijk A., Spieksma F. Constrained admission control to a queueing system. Adv. Appl. Probab. (1989) 21:409–431CrossrefGoogle Scholar
  • Hordijk A., van der Duyn Schouten F. Average optimal policies in Markov decision drift processes with applications to a queueing and a replacement model. Adv. Appl. Probab. (1983) 15:274–303CrossrefGoogle Scholar
  • Hordijk A., Yushkevich A. Blackwell optimality in the class of all policies in Markov decision chains with a Borel state and unbounded rewards. Math. Methods Oper. Res. (1999) 50:421–448CrossrefGoogle Scholar
  • Hordijk A., Yushkevich A., Feinberg E., Shwartz A. Blackwell optimality. Handbook of Markov Decision Processes (2002) (Kluwer, Boston) 231–267CrossrefGoogle Scholar
  • Kakumanu P. Continuous time Markov decision processes with average return criterion. J. Math. Anal. Appl. (1975) 52:173–188CrossrefGoogle Scholar
  • Kijima M.Markov Processes for Stochastic Modeling (1997) (Chapman & Hall, London) CrossrefGoogle Scholar
  • Koole G. The deviation matrix of the M/M/1/∞ and M/M/1/N queue, with application to controlled queueing models. Proc. 37th IEEE CDC (1998) (IEEE Press, Tampa, FL) 56–59CrossrefGoogle Scholar
  • Neuts M.Matrix-Geometric Solutions in Stochastic Models—An Algorithmic Approach (1994) (Constable and Company, London) Google Scholar
  • Olsson M. The EMpht-programme. Manual (1998) (Gothenburg University, Gothenburg, Sweden) Google Scholar
  • Puterman M.Markov Decision Processes (1994) (John Wiley & Sons, New York) CrossrefGoogle Scholar
  • Riska A., Smirni E. M/G/1-type Markov processes: A tutorial. Performance Evaluation of Complex Systems: Techniques and Tools (2002) 2459:315–325Google Scholar
  • Ross S.Applied Probability Models with Optimization Applications (1970) (Holden-Day, San Francisco) Google Scholar
  • Schweitzer E. Perturbation theory and finite Markov chains. J. Appl. Probab. (1968) 5:401–413CrossrefGoogle Scholar
  • Serfozo R. An equivalence between continuous and discrete time Markov decision processes. Oper. Res. (1979) 27:616–620LinkGoogle Scholar
  • Stidham S. Optimal control to a queueing system. IEEE Trans. Automatic Control (1985) 30:705–713CrossrefGoogle Scholar
  • Tijms H.Stochastic Models, An Algorithmic Approach (1994) (John Wiley & Sons, New York) Google Scholar
  • Yushkevich A. Controlled Markov models with countable state and continuous time. Theory Probab. Its Appl. (1977) 22:215–235CrossrefGoogle Scholar
  • Zhang K., Xu Y., Chen X., Cao X. Policy iteration based feedback control. Automatica (2008) 44:1055–1061CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.