An Approximation Approach for the Deviation Matrix of Continuous-Time Markov Processes with Application to Markov Decision Theory
Published Online:26 Feb 2010https://doi.org/10.1287/opre.1090.0786
References
- , Feinberg E., Shwartz A. Applications of Markov decision processes in communication networks. Handbook of Markov Decision Processes (2002) (Kluwer, Boston) 489–536Crossref, Google Scholar
- Zero-sum Markov games and worst-case optimal control of queueing systems. Queueing Systems (1995) 21:415–447Crossref, Google Scholar
- Discrete-Event Control of Stochastic Networks: Multimodularity and Regularity (2003) 1829(Springer, New York) Springer Lecture Notes in MathematicsCrossref, Google Scholar
- Contraction conditions for average and α-discount optimality in countable state Markov games with unbounded rewards. Math. Oper. Res. (1997) 22:588–618Link, Google Scholar
- Discrete-time controlled Markov processes with average cost criterion: A survey. SIAM J. Control Optim. (1993) 31:282–344Crossref, Google Scholar
- Applied Probability and Queues (1987) (Springer, New York) Google Scholar
- Optimal stationary policies for denumerable Markov chains in continuous time. Adv. Appl. Probab. (1976) 8:144–158Crossref, Google Scholar
- Dynamic Programming and Optimal Control (2005) 3rd ed.(Athena Scientific, Nashua, NH) Google Scholar
- Statistical analysis of a telephone call center: A queueing-science perspective. J. Amer. Statist. Assoc. (2005) 469:36–50Crossref, Google Scholar
- Stochastic Learning and Optimization: A Sensitivity-Based Approach (2007) (Springer, New York) Crossref, Google Scholar
- Analysis of multi-server queues with station and server vacations. Eur. J. Oper. Res. (1997) 110:392–406Crossref, Google Scholar
- The deviation matrix of a continuous-time Markov chain. Probab. Engrg. Informational Sci. (2002) 16:351–366Crossref, Google Scholar
- Average, sensitive and Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards. Math. Oper. Res. (1988) 13:395–420Link, Google Scholar
- Denumerable semi-Markov decision chains with small interest rates. Ann. Oper. Res. (1991) 28:185–212Crossref, Google Scholar
- Recurrence conditions for average and Blackwell optimality in denumerable state Markov decision chains. Math. Oper. Res. (1992) 17:271–289Link, Google Scholar
- On the relation between recurrence and ergodicity properties in denumerable Markov decision chains. Math. Oper. Res. (1994) 19:539–559Link, Google Scholar
- Controlled Markov Processes (1979) (Springer-Verlag, New York) Crossref, Google Scholar
- Handbook of Markov Decision Processes: Methods and Applications (2002) (Kluwer, Boston) Crossref, Google Scholar
- Continuous-time Markov decision processes with discounted rewards: The case of Polish spaces. Math. Oper. Res. (2007) 32:73–87Link, Google Scholar
- Optimal control of ergodic continuous-time Markov chains with average sample-path rewards. SIAM J. Control Optim. (2005) 44:29–48Crossref, Google Scholar
- Continuous-time controlled Markov chains with discounted rewards. Acta Appl. Math. (2003) 79:195–216Crossref, Google Scholar
- A note on optimality conditions for continuous-time Markov decision processes with average cost criterion. IEEE Trans. Automatic Control (2001) 46:1984–1989Crossref, Google Scholar
- A survey of recent results on continuous-time Markov decision processes. Sociedad de Estadistica e Investigación Operativa TOP (2006) 14:177–261Google Scholar
- Series expansions for continuous-time Markov processes. Oper. Res. (2009) . ePub ahead of print October 28, http://or.journal.informs.org/cgi/content/abstract/opre.1090.0738v1Google Scholar
- Series expansions for finite-state Markov chains. Probab. Engrg. Informational Sci. (2007) 21:381–400Crossref, Google Scholar
- Constrained admission control to a queueing system. Adv. Appl. Probab. (1989) 21:409–431Crossref, Google Scholar
- Average optimal policies in Markov decision drift processes with applications to a queueing and a replacement model. Adv. Appl. Probab. (1983) 15:274–303Crossref, Google Scholar
- Blackwell optimality in the class of all policies in Markov decision chains with a Borel state and unbounded rewards. Math. Methods Oper. Res. (1999) 50:421–448Crossref, Google Scholar
- , Feinberg E., Shwartz A. Blackwell optimality. Handbook of Markov Decision Processes (2002) (Kluwer, Boston) 231–267Crossref, Google Scholar
- Continuous time Markov decision processes with average return criterion. J. Math. Anal. Appl. (1975) 52:173–188Crossref, Google Scholar
- Markov Processes for Stochastic Modeling (1997) (Chapman & Hall, London) Crossref, Google Scholar
- The deviation matrix of the M/M/1/∞ and M/M/1/N queue, with application to controlled queueing models. Proc. 37th IEEE CDC (1998) (IEEE Press, Tampa, FL) 56–59Crossref, Google Scholar
- Matrix-Geometric Solutions in Stochastic Models—An Algorithmic Approach (1994) (Constable and Company, London) Google Scholar
- The EMpht-programme. Manual (1998) (Gothenburg University, Gothenburg, Sweden) Google Scholar
- Markov Decision Processes (1994) (John Wiley & Sons, New York) Crossref, Google Scholar
- M/G/1-type Markov processes: A tutorial. Performance Evaluation of Complex Systems: Techniques and Tools (2002) 2459:315–325Google Scholar
- Applied Probability Models with Optimization Applications (1970) (Holden-Day, San Francisco) Google Scholar
- Perturbation theory and finite Markov chains. J. Appl. Probab. (1968) 5:401–413Crossref, Google Scholar
- An equivalence between continuous and discrete time Markov decision processes. Oper. Res. (1979) 27:616–620Link, Google Scholar
- Optimal control to a queueing system. IEEE Trans. Automatic Control (1985) 30:705–713Crossref, Google Scholar
- Stochastic Models, An Algorithmic Approach (1994) (John Wiley & Sons, New York) Google Scholar
- Controlled Markov models with countable state and continuous time. Theory Probab. Its Appl. (1977) 22:215–235Crossref, Google Scholar
- Policy iteration based feedback control. Automatica (2008) 44:1055–1061Crossref, Google Scholar

