Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

Eugene A. Feinberg
Eugene A. Feinberg
[email protected]
Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794
Search for more papers by this author
,
Pavlo O. Kasyanov
Pavlo O. Kasyanov
[email protected]
Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine
Search for more papers by this author
,
Nina V. Zadoianchuk
Nina V. Zadoianchuk
[email protected]
Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine
Search for more papers by this author

Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794

Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine

Search for more papers by this author

Nina V. Zadoianchuk

[email protected]

Institute for Applied System Analysis, National Technical University of Ukraine “Kyiv Polytechnic Institute,” Kyiv, Ukraine

Search for more papers by this author

Published Online:5 Sep 2012https://doi.org/10.1287/moor.1120.0555

Abstract

This paper presents sufficient conditions for the existence of stationary optimal policies for average cost Markov decision processes with Borel state and action sets and weakly continuous transition probabilities. The one-step cost functions may be unbounded, and the action sets may be noncompact. The main contributions of this paper are: (i) general sufficient conditions for the existence of stationary discount optimal and average cost optimal policies and descriptions of properties of value functions and sets of optimal actions, (ii) a sufficient condition for the average cost optimality of a stationary policy in the form of optimality inequalities, and (iii) approximations of average cost optimal actions by discount optimal actions.

Cited by
- When to Push Ads: Optimal Mobile Ad Campaign Strategy Under Markov Customer Dynamics
  Guokai Li,
  Pin Gao,
  Zizhuo Wang
  11 May 2026 | Manufacturing & Service Operations Management, Vol. 0, No. 0
- Kernel mean embedding topology: Weak and strong forms for stochastic kernels and implications for model learning
  The Annals of Applied Probability, Vol. 36, No. 2
- Інтегровані дослідження в галузі штучного інтелекту, математичного моделювання та оборонних застосувань
  22 December 2025 | Visnik Nacional noi academii nauk Ukrai ni, No. 12
- On Average Optimality for Non-Stationary Markov Decision Processes in Borel Spaces
  Xin Guo,
  Yonghui Huang,
  Yi Zhang
  7 October 2024 | Mathematics of Operations Research, Vol. 50, No. 4
- An optimal sequence for sub-Markov decision processes with risk sensitivity
  Systems & Control Letters, Vol. 205
- Optimality of Base-Stock Policy Under Unknown General Demand Distributions: New Methods, New Results, and Computations
  9 June 2025 | Production and Operations Management, Vol. 34, No. 11
- Acute angle lemma for noncompact image sets
  18 July 2025 | Journal of Fixed Point Theory and Applications, Vol. 27, No. 3
- A Mean-Field Approach for Ergodic Nonzero-Sum Stochastic Games in a System of Interacting Objects with Additive Costs
  14 August 2025 | Dynamic Games and Applications, Vol. 45
- Continuity of Filters for Discrete-Time Control Problems Defined by Explicit Equations
  12 May 2025 | SIAM Journal on Control and Optimization, Vol. 63, No. 3
- Another Look at Partially Observed Optimal Stochastic Control: Existence, Ergodicity, and Approximations Without Belief-Reduction
  4 January 2025 | Applied Mathematics & Optimization, Vol. 91, No. 1
- Near Optimal Approximations and Finite Memory Policies for POMPDs with Continuous Spaces
  19 March 2025 | Journal of Systems Science and Complexity, Vol. 38, No. 1
- AI Methodology for Modeling Protein Interactions in Biological Systems
  26 April 2025 | Cybernetics and Systems Analysis, Vol. 61, No. 1
- AI METHODOLOGY FOR MODELING PROTEIN INTERACTIONS IN BIOLOGICAL SYSTEMS
  1 January 2025 | KIBERNETYKA TA SYSTEMNYI ANALIZ
- Sufficient Conditions for Solving Statistical Filtering Problems by Dynamic Programming
- The principle of optimality in dynamic programming: A pedagogical note
  Operations Research Letters, Vol. 57
- Discounted cost exponential semi-Markov decision processes with unbounded transition rates: a service rate control problem with impatient customers
  18 April 2024 | Probability in the Engineering and Informational Sciences, Vol. 38, No. 4
- Relative Q-Learning for Average-Reward Markov Decision Processes With Continuous States
  IEEE Transactions on Automatic Control, Vol. 69, No. 10
- Asymptotic Optimality of Constant-Order Policies in Joint Pricing and Inventory Models
  Xin Chen,
  Alexander L. Stolyar,
  Linwei Xin
  17 April 2023 | Mathematics of Operations Research, Vol. 49, No. 1
- Reinforcement Learning for Partially Observable Models
  2 February 2024
- Convergence of Finite Memory Q Learning for POMDPs and Near Optimality of Learned Policies Under Filter Stability
  Ali Devran Kara,
  Serdar Yüksel
  21 November 2022 | Mathematics of Operations Research, Vol. 48, No. 4
- Asymptotic Optimality of Semi-Open-Loop Policies in Markov Decision Processes with Large Lead Times
  Xingyu Bai,
  Xin Chen,
  Menglong Li,
  Alexander Stolyar
  15 June 2023 | Operations Research, Vol. 71, No. 6
- Technical Note—Average Cost Optimality in Partially Observable Lost-Sales Inventory Systems
  Xingyu Bai,
  Xin Chen,
  Alexander L. Stolyar
  10 June 2022 | Operations Research, Vol. 71, No. 6
- Formalization of Methods for the Development of Autonomous Artificial Intelligence Systems
  7 October 2023 | Cybernetics and Systems Analysis, Vol. 59, No. 5
- Zeroes of Multifunctions with Noncompact Image Sets
  14 September 2023 | Axioms, Vol. 12, No. 9
- Continuity of discounted values and the structure of optimal policies for periodic‐review inventory systems with setup costs
  29 March 2023 | Naval Research Logistics (NRL), Vol. 70, No. 5
- Optimization of Generalized Certainty Equivalents on the Finite Horizon
  26 June 2023 | Journal of Materials and Mechatronics: A, Vol. 4, No. 1
- Hybrid Radio Resource Management Based on Multi-Agent Reinforcement Learning
- A survey of average cost problems in deterministic discrete-time control systems
  Journal of Mathematical Analysis and Applications, Vol. 522, No. 1
- A note on the existence of optimal stationary policies for average Markov decision processes with countable states
  Automatica, Vol. 151
- Compactly Restrictable Metric Policy Optimization Problems
  IEEE Transactions on Automatic Control, Vol. 68, No. 5
- Formalization and Development of Autonomous Artificial Intelligence Systems
  29 August 2023
- Convex analytic method revisited: Further optimality results and performance of deterministic policies in average cost stochastic control
  Journal of Mathematical Analysis and Applications, Vol. 517, No. 2
- When to Push Ads: Optimal Mobile Ad Campaign Strategy under Markov Customer Dynamics
  1 January 2023 | SSRN Electronic Journal, Vol. 60
- Near Optimality of Finite Memory Policies for POMPDs with Continuous Spaces
- Efficient Online Learning Based Cross-Tier Uplink Scheduling in HetNets
  IEEE/ACM Transactions on Networking, Vol. 30, No. 6
- Structure of optimal policies to periodic-review inventory models with convex costs and backorders for all values of discount factors
  17 June 2017 | Annals of Operations Research, Vol. 317, No. 1
- On the optimality equation for average cost Markov decision processes and its validity for inventory control
  22 June 2017 | Annals of Operations Research, Vol. 317, No. 2
- Continuity of equilibria for two-person zero-sum games with noncompact action sets and unbounded payoffs
  5 October 2017 | Annals of Operations Research, Vol. 317, No. 2
- LP based upper and lower bounds for Cesàro and Abel limits of the optimal values in problems of control of stochastic discrete time systems
  Journal of Mathematical Analysis and Applications, Vol. 512, No. 1
- Markov Decision Processes with Incomplete Information and Semiuniform Feller Transition Probabilities
  22 August 2022 | SIAM Journal on Control and Optimization, Vol. 60, No. 4
- On structural properties of optimal average cost functions in Markov decision processes with Borel spaces and universally measurable policies
  Journal of Mathematical Analysis and Applications, Vol. 509, No. 1
- Unbounded dynamic programming via the Q-transform
  Journal of Mathematical Economics, Vol. 100
- Learning While Repositioning in On-Demand Vehicle Sharing Networks
  SSRN Electronic Journal, Vol. 63
- MDPs with setwise continuous transition probabilities
  Operations Research Letters, Vol. 49, No. 5
- Learning to Schedule Network Resources Throughput and Delay Optimally Using Q + -Learning
  IEEE/ACM Transactions on Networking, Vol. 29, No. 2
- Average Cost Markov Decision Processes with Semi-Uniform Feller Transition Probabilities
  5 June 2021
- A useful technique for piecewise deterministic Markov decision processes
  Operations Research Letters, Vol. 49, No. 1
- STOCHASTIC SETUP-COST INVENTORY MODEL WITH BACKORDERS AND QUASICONVEX COST FUNCTIONS
  4 April 2019 | Probability in the Engineering and Informational Sciences, Vol. 34, No. 3
- Asymptotic Optimality of Finite Model Approximations for Partially Observed Markov Decision Processes With Discounted Cost
  IEEE Transactions on Automatic Control, Vol. 65, No. 1
- Structural Results for Average‐Cost Inventory Models with Markov‐Modulated Demand and Partial Information
  1 January 2020 | Production and Operations Management, Vol. 29, No. 1
- Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies
  24 August 2020 | SIAM Journal on Control and Optimization, Vol. 58, No. 4
- On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
  10 March 2020 | SIAM Journal on Control and Optimization, Vol. 58, No. 2
- Fatou's Lemma for Weakly Converging Measures under the Uniform Integrability Condition
  13 February 2020 | Theory of Probability & Its Applications, Vol. 64, No. 4
- Fatou's Lemma in Its Classical Form and Lebesgue's Convergence Theorems for Varying Measures with Applications to Markov Decision Processes
  5 August 2020 | Theory of Probability & Its Applications, Vol. 65, No. 2
- Asymptotic Optimality of Semi-Open-Loop Policies in Markov Decision Processes with Large Lead Times
  SSRN Electronic Journal, Vol. 63
- Лемма Фату в классической форме и теоремы Лебега о сходимости для последовательности мер с приложениями к управляемым марковским процессам
  22 April 2020 | Теория вероятностей и ее применения, Vol. 65, No. 2
- Beyond Max-weight Scheduling: A Reinforcement Learning-based Approach
- AoI-Penalty Minimization for Networked Control Systems with Packet Loss
- On the reduction of total‐cost and average‐cost MDPs to discounted MDPs
  25 May 2017 | Naval Research Logistics (NRL), Vol. 66, No. 1
- Fatou's lemma for weakly converging measures under the uniform integrability condition
  22 October 2019 | Теория вероятностей и ее применения, Vol. 64, No. 4
- On the convergence of optimal actions for Markov decision processes and the optimality of ( s , S ) inventory policies
  11 August 2017 | Naval Research Logistics (NRL), Vol. 65, No. 8
- Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: The fixed-point approach revisited
  Journal of Mathematical Analysis and Applications, Vol. 464, No. 1
- Planning for the long run: Programming with patient, Pareto responsive preferences
  Journal of Economic Theory, Vol. 176
- Learning based Utility Maximization for Multi-Resource Management
  20 June 2018
- Online Learning based Uplink Scheduling in HetNets with Limited Backhaul Capacity
- Reduction of total-cost and average-cost MDPs with weakly continuous transition probabilities to discounted MDPs
  Operations Research Letters, Vol. 46, No. 2
- Indirect Lyapunov Method for Autonomous Dynamical Systems
  12 July 2017
- Introduction and Summary
  12 May 2018
- Prelude to Part I
  12 May 2018
- Infinite Horizon Optimal Transmission Power Control for Remote State Estimation Over Fading Channels
  IEEE Transactions on Automatic Control, Vol. 63, No. 1
- On the average-cost optimality equations and convergence of discounted-cost relative value functions for inventory control problems with quasiconvex cost functions
- On the Asymptotic Optimality of Finite Approximations to Markov Decision Processes with Borel Spaces
  Naci Saldi,
  Serdar Yüksel,
  Tamás Linder
  15 March 2017 | Mathematics of Operations Research, Vol. 42, No. 4
- Method of Artificial Control and the 3D Navier-Stokes System
  7 December 2017
- Constrained Markov decision processes in Borel spaces: from discounted to average optimality
  20 June 2016 | Mathematical Methods of Operations Research, Vol. 84, No. 3
- Uniform Fatou's lemma
  Journal of Mathematical Analysis and Applications, Vol. 444, No. 1
- Optimality Conditions for Inventory Control
  Eugene A. Feinberg
  4 November 2016
- Structure of Optimal Solutions to Periodic-Review Total-Cost Stochastic Inventory Control Problems
  29 September 2016 | ACM SIGMETRICS Performance Evaluation Review, Vol. 44, No. 2
- Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
  Eugene A. Feinberg,
  Pavlo O. Kasyanov,
  Michael Z. Zgurovsky
  22 January 2016 | Mathematics of Operations Research, Vol. 41, No. 2
- Near optimality of quantized policies in stochastic control under weak continuity conditions
  Journal of Mathematical Analysis and Applications, Vol. 435, No. 1
- Finite-state approximation of Markov decision processes with unbounded costs and Borel spaces
- A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies
  Huizhen Yu,
  Dimitri P. Bertsekas
  1 April 2015 | Mathematics of Operations Research, Vol. 40, No. 4
- Continuity of Minima: Local Results
  8 February 2015 | Set-Valued and Variational Analysis, Vol. 23, No. 3
- Finite state approximations of Markov decision processes with general state and action spaces
- On the vanishing discount factor approach for Markov decision processes with weakly continuous transition probabilities
  Journal of Mathematical Analysis and Applications, Vol. 426, No. 2
- Asymptotic Optimality and Rates of Convergence of Quantized Stationary Policies in Stochastic Control
  IEEE Transactions on Automatic Control, Vol. 60, No. 2
- On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes
  SIAM Journal on Control and Optimization, Vol. 53, No. 4
- Examples concerning Abel and Cesàro limits
  Journal of Mathematical Analysis and Applications, Vol. 420, No. 2
- Convergence of value iterations for total-cost MDPs and POMDPs with general state and action sets
- Optimal stabilizing controllers for discrete-time linear systems with Markovian jumping parameters under state measurements
- Convergence of probability measures and Markov decision models with incomplete information
  24 January 2015 | Proceedings of the Steklov Institute of Mathematics, Vol. 287, No. 1
- Average Optimality for Continuous-Time Markov Decision Processes Under Weak Continuity Conditions
  30 January 2018 | Journal of Applied Probability, Vol. 51, No. 4
- Bergeʼs maximum theorem for noncompact image sets
  Journal of Mathematical Analysis and Applications, Vol. 413, No. 2
- More Risk-Sensitive Markov Decision Processes
  Nicole Bäuerle,
  Ulrich Rieder
  17 June 2013 | Mathematics of Operations Research, Vol. 39, No. 1
- Optimality Conditions for Partially Observable Markov Decision Processes
  26 November 2013
- Fatou's Lemma for Weakly Converging Probabilities
  Theory of Probability & Its Applications, Vol. 58, No. 4
- Optimality conditions for total-cost Partially Observable Markov Decision Processes
- Berge’s theorem for noncompact image sets
  Journal of Mathematical Analysis and Applications, Vol. 397, No. 1
- Fatou's lemma for weakly converging probabilities
  Теория вероятностей и ее применения, Vol. 58, No. 4

cover image Mathematics of Operations Research

Volume 37, Issue 4

November 2012

Pages 559-674

Article Information

Metrics

Information

Received:February 16, 2012
Published Online:September 05, 2012

Cite as

Eugene A. Feinberg, Pavlo O. Kasyanov, Nina V. Zadoianchuk, (2012) Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities. Mathematics of Operations Research 37(4):591-607.

https://doi.org/10.1287/moor.1120.0555

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

Abstract

Volume 37, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News