Discounted Continuous-Time Markov Decision Processes with Constraints: Unbounded Transition and Loss Rates

Published Online:https://doi.org/10.1287/moor.1100.0477

References

  • Altman E.Constrained Markov Decision Processes (1999) (Chapman & Hall/CRC, Boca Raton, FL) Google Scholar
  • Bertsekas D., Shreve S.Stochastic Optimal Control (1978) (Academic Press, New York) Google Scholar
  • Bertsekas D., Nedić A., Ozdağlar A. E.Convex Analysis and Optimization (2003) (Athena Scientific, Belmont, MA) Google Scholar
  • Brémaud P.Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues (1999) (Springer, New York) CrossrefGoogle Scholar
  • Ethier S. N., Kurtz T. G.Markov Processes; Characterization and Convergence (1986) (Wiley, New York) CrossrefGoogle Scholar
  • Feinberg E. Continuous time discounted jump Markov decision processes: A discrete-event approach. Math. Oper. Res. (2004) 29(3):492–524LinkGoogle Scholar
  • Goffman C., Pedrick G.First Course in Functional Analysis (1983) (Chelsea Publishing Company, New York) Google Scholar
  • Guo X. Continuous-time Markov decision processes with discounted rewards: The case of Polish spaces. Math. Oper. Res. (2007) 32(1):73–87LinkGoogle Scholar
  • Guo X., Hernández-Lerma O. Constrained continuous-time Markov controlled processes with discounted criteria. Stochastic Anal. Appl. (2003) 21(2):379–399CrossrefGoogle Scholar
  • Guo X., Hernández-Lerma O. Continuous-time controlled Markov chains with discounted rewards. Acta Appl. Math. (2003) 79(3):195–216CrossrefGoogle Scholar
  • Guo X., Hernández-Lerma O. Drift and monotonicity conditions for continuous-time controlled Markov chains with an average criterion. IEEE Trans. Automatic Control (2003) 48(2):236–245CrossrefGoogle Scholar
  • Guo X., Hernández-Lerma O., Prieto-Rumeau T. A survey of recent results on continuous-time Markov decision processes. TOP (2006) 14(2):177–257CrossrefGoogle Scholar
  • Hernández-Lerma O., Lasserre J. B.Discrete-Time Markov Control Processes (1996) (Springer, New York) CrossrefGoogle Scholar
  • Hernández-Lerma O., Lasserre J. B.Further Topics on Discrete-Time Markov Control Processes (1999) (Springer, New York) CrossrefGoogle Scholar
  • Jacod J. Multivariate point processes: Predictable projection, Radon-Nicodym derivatives, representation of martingales. Z. Wahrscheinlichkeitstheorie und verwandte Gebiete (1975) 31:235–253CrossrefGoogle Scholar
  • Kitaev M. Y. Semi-Markov and jump Markov controllable models: Average cost criterion. SIAM Theory Probab. Appl. (1985) 30:272–288CrossrefGoogle Scholar
  • Kitaev M. Y., Rykov V. V.Controlled Queueing Systems (1995) (CRC Press, New York) Google Scholar
  • Lasserre J. B. Moments and sums of squares for polynomial optimization and related problems. J. Global Optim. (2009) 45(1):39–61CrossrefGoogle Scholar
  • Liptzer R. S., Shiryaev A. N.Theory of Martingales (1989) (Kluwer, Dordrecht, The Netherlands) CrossrefGoogle Scholar
  • Meyer P. A.Probability and Potentials (1966) (Blaisdell Publishing Company, Waltham, MA) Google Scholar
  • Parthasarathy K. R.Probability Measures on Metric Spaces (2005) (AMS Chelsea Publishing, Providence, RI) Google Scholar
  • Piunovskiy A. B. Control of impulsive processes in problems with constraints. Automatic Remote Control (1994) 55:520–531Google Scholar
  • Piunovskiy A. B. The problem of convex programming with linear constraints. Computational Math. Math. Phys. (1994) 34(4):463–470Google Scholar
  • Piunovskiy A. B.Optimal Control of Random Sequences in Problems with Constraints (1997) (Kluwer Academic Publishers, Dordrecht, The Netherlands) CrossrefGoogle Scholar
  • Piunovskiy A. B. A controlled jump discounted model with constraints. Theory Probab. Appl. (1998) 42(1):51–72CrossrefGoogle Scholar
  • Piunovskiy A. B. Optimal interventions in countable jump Markov processes. Math. Oper. Res. (2004) 29:289–308LinkGoogle Scholar
  • Piunovskiy A. B. Discounted continuous time Markov decision processes: The convex analytic approach. Sixteenth Triennial IFAC World Congress (2005) Prague, Czech RepublicCrossrefGoogle Scholar
  • Prieto-Rumeau T., Hernández-Lerma O., Piunovskiy A. Policy iteration and finite approximations to discounted continuous-time controlled Markov chains. Modern Trends in Controlled Stochastic Processes (2010) (Luniver Press, Beckington, UK) 84–101Google Scholar
  • Puterman M. L.Markov Decision Processes (1994) (Wiley, New York) CrossrefGoogle Scholar
  • Rockafellar R. T.Conjugate Duality and Optimization (1989) (SIAM, Philadelphia) 45Google Scholar
  • Yan H., Zhang J., Guo X. Continuous-time Markov decision processes with unbounded transition and discounted-reward rates. Stochastic Anal. Appl. (2008) 26(2):209–231CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.