Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

Eugene A. Feinberg
Eugene A. Feinberg
[email protected]
Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794
Search for more papers by this author
,
Pavlo O. Kasyanov
Pavlo O. Kasyanov
[email protected]
Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine
Search for more papers by this author
,
Nina V. Zadoianchuk
Nina V. Zadoianchuk
[email protected]
Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine
Search for more papers by this author

Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794

Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine

Search for more papers by this author

Nina V. Zadoianchuk

[email protected]

Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine

Search for more papers by this author

Published Online:5 Sep 2012https://doi.org/10.1287/moor.1120.0555

References

Arapostathis A, Borkar VS, Fernandez-Gaucherand E, Ghosh MK, Marcus SI. Discrete-time controlled Markov processes with average cost criterion: A survey. SIAM J. Control Optim. (1993) 31(2):282–344Crossref, Google Scholar
Bather J. Optimal decision procedures for finite Markov chains. Part I: Examples. Adv. Appl. Probab. (1973) 5(2):328–339Crossref, Google Scholar
Berge E. Topological Spaces (1963) (Macmillan, New York) Google Scholar
Bertsekas DP, Shreve SE. Stochastic Optimal Control: The Discrete-Time Case (1996) (Athena Scientific, Belmont, MA) Google Scholar
Billingsley P. Convergence of Probability Measures (1968) (John Wiley & Sons, New York) Google Scholar
Blackwell D. Discrete dynamic programming. Ann. Math. Statist. (1962) 33(2):719–726Crossref, Google Scholar
Cavazos-Cadena R. A counterexample on the optimality equation in Markov decision chains with the average cost criterion. Systems and Control Lett. (1991) 16(5):387–392Crossref, Google Scholar
Chen RC, Feinberg EA. Compactness of the space of non-randomized policies in countable-state sequential decision processes. Math. Methods Oper. Res. (2010) 71(2):307–323Crossref, Google Scholar
Chitashvili RY. A controlled finite Markov chain with an arbitrary set of decisions. Theor. Probab. Appl. (1975) 20(4):839–847Crossref, Google Scholar
Derman C. On sequential decisions and Markov chains. Management Sci. (1962) 9(1):16–24Link, Google Scholar
Dynkin EB, Yushkevich AA. Controlled Markov Processes (1979) (Springer-Verlag, New York) Crossref, Google Scholar
Feinberg EA. An ϵ-optimal control of a finite Markov chain. Theoret. Probab. Appl. (1980) 25(1):70–81Crossref, Google Scholar
Feinberg EA, Lewis ME. Optimality of four-threshold policies in inventory systems with customer returns and borrowing/storage options. Probab. Engrg. Inform. Sci. (2004) 19(1):45–71Crossref, Google Scholar
Feinberg EA, Lewis ME. Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem. Math. Oper. Res. (2007) 32(4):769–783Link, Google Scholar
Feinberg EA, Kasyanov PO, Zadoianchuk NV. Berge’s theorem for noncompact image sets. J. Math. Anal. Appl. (2012) . ForthcomingGoogle Scholar
Feinberg EA, Kasyanov PO, Zadoianchuk NV. Fatou’s lemma for weakly converging probabilities. (2012) . "http://arXiv:1206.4073v1Google Scholar
Gubenko LG, Shtatland ES. On controlled, discrete-time Markov decision processes. Theory Probab. Math. Statist. (1975) 7:47–61Google Scholar
Hernández-Lerma O. Averege optimality in dynamic programming on Borel spaces—Unbounded costs and controls. Systems and Control Lett. (1991) 17(3):237–242Crossref, Google Scholar
Hernández-Lerma O, Lasserre JB. Discrete-Time Markov Control Processes: Basic Optimality Criteria (1996) (Springer-Verlag, New York) Crossref, Google Scholar
Hernández-Lerma O, Lasserre JB. Fatou’s lemma and Lebesgue’s convergence theorem for measures. J. Appl. Math. Stochastic Anal. (2000) 13(2):137–146Crossref, Google Scholar
Kechris AS. Classical Descriptive Set Theory (1995) (Springer-Verlag, New York) Crossref, Google Scholar
Luque-Vásquez F, Hernández-Lerma O. A counterexample on the semicontinuity of minima. Proc. Amer. Math. Soc. (1995) 123(10):3175–3176Crossref, Google Scholar
Ross SM. Non-discounted denumerable Markovian decision model. Ann. Math. Statist. (1968) 39(2):412–424Crossref, Google Scholar
Ross SM. Arbitrary state Markovian decision processes. Ann. Math. Statist. (1968) 39(6):2118–2122Crossref, Google Scholar
Ross SM. On the nonexistence of ϵ-optimal randomized stationary policies in average cost Markov decision models. Ann. Math. Statist. (1971) 42(5):1767–1768Crossref, Google Scholar
Schäl M. Average optimality in dynamic programming with general state space. Math. Oper. Res. (1993) 18(1):163–172Link, Google Scholar
Sennott LI. Stochastic Dynamic Programming and the Control of Queueing Systems (1999) (John Wiley & Sons, New York) Google Scholar
Sennott LI, Feinberg EA, Shwartz A. Average reward optimization theory for denumerable state spaces. Handbook of Markov Decision Processes (2002) (Kluwer, Boston) 153–172Methods and ApplicationsCrossref, Google Scholar
Serfozo R. Convergence of Lebesgue integrals with varying measures. Sankhya: The Indian Journal of Statistics (Series A) (1982) 44(3):380–402Google Scholar
Taylor HM. Markovian sequential replacement processes. Ann. Math. Statist. (1965) 36(6):1677–1694Crossref, Google Scholar
Viskov OV, Shiryaev AN. On controls which reduce to optimal stationary regimes. Trudy Mat. Inst. Steklov. (1964) 71:35–45[In Russian; English translation: Report Number FTD-HT-67-69, National Technical Information Service, U.S. Department of Commerce]Google Scholar
Zgurovsky MZ, Mel’nik VS, Kasyanov PO. Evolution Inclusions and Variation Inequalities for Earth Data Processing I (2011) (Springer, Berlin) Crossref, Google Scholar

cover image Mathematics of Operations Research

Volume 37, Issue 4

November 2012

Pages 559-674

Article Information

Metrics

Information

Received:February 16, 2012
Published Online:September 05, 2012

Cite as

Eugene A. Feinberg, Pavlo O. Kasyanov, Nina V. Zadoianchuk, (2012) Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities. Mathematics of Operations Research 37(4):591-607.

https://doi.org/10.1287/moor.1120.0555

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

References

Volume 37, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News