Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
Published Online:5 Sep 2012https://doi.org/10.1287/moor.1120.0555
References
- . Discrete-time controlled Markov processes with average cost criterion: A survey. SIAM J. Control Optim. (1993) 31(2):282–344Crossref, Google Scholar
- . Optimal decision procedures for finite Markov chains. Part I: Examples. Adv. Appl. Probab. (1973) 5(2):328–339Crossref, Google Scholar
- . Topological Spaces (1963) (Macmillan, New York) Google Scholar
- . Stochastic Optimal Control: The Discrete-Time Case (1996) (Athena Scientific, Belmont, MA) Google Scholar
- . Convergence of Probability Measures (1968) (John Wiley & Sons, New York) Google Scholar
- . Discrete dynamic programming. Ann. Math. Statist. (1962) 33(2):719–726Crossref, Google Scholar
- . A counterexample on the optimality equation in Markov decision chains with the average cost criterion. Systems and Control Lett. (1991) 16(5):387–392Crossref, Google Scholar
- . Compactness of the space of non-randomized policies in countable-state sequential decision processes. Math. Methods Oper. Res. (2010) 71(2):307–323Crossref, Google Scholar
- . A controlled finite Markov chain with an arbitrary set of decisions. Theor. Probab. Appl. (1975) 20(4):839–847Crossref, Google Scholar
- . On sequential decisions and Markov chains. Management Sci. (1962) 9(1):16–24Link, Google Scholar
- . Controlled Markov Processes (1979) (Springer-Verlag, New York) Crossref, Google Scholar
- . An ϵ-optimal control of a finite Markov chain. Theoret. Probab. Appl. (1980) 25(1):70–81Crossref, Google Scholar
- . Optimality of four-threshold policies in inventory systems with customer returns and borrowing/storage options. Probab. Engrg. Inform. Sci. (2004) 19(1):45–71Crossref, Google Scholar
- . Optimality inequalities for average cost Markov decision processes and the stochastic cash balance problem. Math. Oper. Res. (2007) 32(4):769–783Link, Google Scholar
- . Berge’s theorem for noncompact image sets. J. Math. Anal. Appl. (2012) . ForthcomingGoogle Scholar
- . Fatou’s lemma for weakly converging probabilities. (2012) . "http://arXiv:1206.4073v1Google Scholar
- . On controlled, discrete-time Markov decision processes. Theory Probab. Math. Statist. (1975) 7:47–61Google Scholar
- . Averege optimality in dynamic programming on Borel spaces—Unbounded costs and controls. Systems and Control Lett. (1991) 17(3):237–242Crossref, Google Scholar
- . Discrete-Time Markov Control Processes: Basic Optimality Criteria (1996) (Springer-Verlag, New York) Crossref, Google Scholar
- . Fatou’s lemma and Lebesgue’s convergence theorem for measures. J. Appl. Math. Stochastic Anal. (2000) 13(2):137–146Crossref, Google Scholar
- . Classical Descriptive Set Theory (1995) (Springer-Verlag, New York) Crossref, Google Scholar
- . A counterexample on the semicontinuity of minima. Proc. Amer. Math. Soc. (1995) 123(10):3175–3176Crossref, Google Scholar
- . Non-discounted denumerable Markovian decision model. Ann. Math. Statist. (1968) 39(2):412–424Crossref, Google Scholar
- . Arbitrary state Markovian decision processes. Ann. Math. Statist. (1968) 39(6):2118–2122Crossref, Google Scholar
- . On the nonexistence of ϵ-optimal randomized stationary policies in average cost Markov decision models. Ann. Math. Statist. (1971) 42(5):1767–1768Crossref, Google Scholar
- . Average optimality in dynamic programming with general state space. Math. Oper. Res. (1993) 18(1):163–172Link, Google Scholar
- . Stochastic Dynamic Programming and the Control of Queueing Systems (1999) (John Wiley & Sons, New York) Google Scholar
- , Feinberg EA, Shwartz A. Average reward optimization theory for denumerable state spaces. Handbook of Markov Decision Processes (2002) (Kluwer, Boston) 153–172Methods and ApplicationsCrossref, Google Scholar
- . Convergence of Lebesgue integrals with varying measures. Sankhya: The Indian Journal of Statistics (Series A) (1982) 44(3):380–402Google Scholar
- . Markovian sequential replacement processes. Ann. Math. Statist. (1965) 36(6):1677–1694Crossref, Google Scholar
- . On controls which reduce to optimal stationary regimes. Trudy Mat. Inst. Steklov. (1964) 71:35–45[In Russian; English translation: Report Number FTD-HT-67-69, National Technical Information Service, U.S. Department of Commerce]Google Scholar
- . Evolution Inclusions and Variation Inequalities for Earth Data Processing I (2011) (Springer, Berlin) Crossref, Google Scholar

