Average Optimality in Nonhomogeneous Infinite Horizon Markov Decision Processes

Published Online:https://doi.org/10.1287/moor.1100.0478

References

  • Alden J. M., Smith R. L. Rolling horizon procedures in nonhomogeneous Markov decision processes. Oper. Res. (1992) 40:S183–S194LinkGoogle Scholar
  • Aubin J.-P.Set-Valued Analysis (1990) (Birkhauser, Boston) Google Scholar
  • Bertsekas D., Shreve S.Stochastic Optimal Control: The Discrete Time Case (1978) (Academic Press, San Diego) Google Scholar
  • Derman C. Denumerable state Markovian decision processes average cost case. Ann. Math. Statist. (1966) 37:1545–1554CrossrefGoogle Scholar
  • Dynkin E., Yushkevich A. A.Controlled Markov Processes (1979) (Springer, Berlin) CrossrefGoogle Scholar
  • Federgruen A., Tijms H. C. The optimality equation in average cost denumerable state semi-Markov decision problems, recurrency conditions and algorithms. J. Appl. Probab. (1978) 15:356–373CrossrefGoogle Scholar
  • Feinberg E. Controlled Markov processes with arbitrary numerical criteria. SIAM Theory Probab. Appl. (1982) 25:486–503Google Scholar
  • Feinberg E., Shwartz A.Handbook of Markov Decision Processes: Methods and Algorithms (2002) (Kluwer, Boston) CrossrefGoogle Scholar
  • Goldberg R. R.Methods of Real Analysis (1964) (Blaisdell, Waltham, MA) 50Google Scholar
  • Guo X., Liu J., Liu K. Nonhomogeneous Markov decision processes with Borel state space—The average criterion with nonuniformly bounded rewards. Math. Oper. Res. (2000) 25:667–678LinkGoogle Scholar
  • Halkin H. Necessary conditions for optimal control problems with infinite horizons. Econometrica (1974) 42:267–272CrossrefGoogle Scholar
  • Hopp W. Nonhomogeneous Markov decision processes with applications to R&D planning. (1984) . Ph.D. dissertation, University of Michigan, Ann ArborGoogle Scholar
  • Hopp W., Bean J., Smith R. L. A new optimality criterion for nonhomogeneous Markov decision processes. Oper. Res. (1987) 35:875–883LinkGoogle Scholar
  • Kuratowski K.Topologie I, II (1966) (Academic Press, New York) Google Scholar
  • Munkres J. R.Topology: A First Course (1975) (Prentice-Hall, Englewood Cliffs, NJ) Google Scholar
  • Puterman M. L.Markov Decision Processes: Discrete Stochastic Dynamic Programming (1994) (Wiley, New York) CrossrefGoogle Scholar
  • Ross S. M. Non-discounted denumerable Markovian decision models. Ann. Math. Statist. (1968) 39:412–2423CrossrefGoogle Scholar
  • Ryan S. M., Bean J. C., Smith R. L. A tie-breaking rule for discrete infinite horizon optimization. Oper. Res. (1992) 40:S117–S126LinkGoogle Scholar
  • Schochetman I. E., Smith R. L. Convergence of selections with applications in optimization. J. Math. Anal. Appl. (1991) 155:278–292CrossrefGoogle Scholar
  • Schochetman I. E., Smith R. L. Existence and discovery of average optimal solutions in deterministic infinite horizon optimization. Math. Oper. Res. (1998) 20:416–432LinkGoogle Scholar
  • Seneta E.Non-Negative Matrices and Markov Chains (1981) (Springer-Verlag, New York) CrossrefGoogle Scholar
  • Tijms H. C.A First Course in Stochastic Models (2003) (Wiley, New York) CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.