Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs

Linn I. Sennott
Linn I. Sennott
Illinois State University, Normal, Illinois
Search for more papers by this author

Illinois State University, Normal, Illinois

Published Online:1 Aug 1989https://doi.org/10.1287/opre.37.4.626

Abstract

We deal with infinite state Markov decision processes with unbounded costs. Three simple conditions, based on the optimal discounted value function, guarantee the existence of an expected average cost optimal stationary policy. Sufficient conditions are the existence of a distinguished state of smallest discounted value and a single stationary policy inducing an irreducible, ergodic Markov chain for which the average cost of a first passage from any state to the distinguished state is finite. A result to verify this is also given. Two examples illustrate the ease of applying the criteria.

Cited by
- Index-Based Scheduling for a Resource-Constrained Quantum Switch
- Optimal Preemption Policy for Age of Information Minimization with Random Arrival and Known Packet Length
  23 February 2026 | ACM Transactions on Sensor Networks, Vol. 22, No. 2
- Utility-Oriented Rate-Splitting Multiple Access for Multi-Type Services in Satellite-Integrated Internet
  IEEE Transactions on Communications, Vol. 74
- Preempting to Minimize Age of Incorrect Information Under Transmission Delay
  IEEE Transactions on Networking, Vol. 34
- Optimizing AoI at Query in Multiuser Wireless Uplink Networks: A Whittle Index Approach
  IEEE Transactions on Communications, Vol. 73, No. 11
- Utility Loss of Information Minimization for Semantic-Empowered Satellite-Integrated Internet
  IEEE Transactions on Communications, Vol. 73, No. 11
- Equidistant-Sample or Wait-and-Sample to Minimize Age Under Sampling Constraint?
- A Novel Switch-Type Policy Network for Resource Allocation Problems
- Tracking and Assigning Jobs to a Markov Machine
- Structure-Aware Reinforcement Learning for Optimal Transmission Scheduling Over Packet Length-Dependent Lossy Networks
  IEEE Transactions on Automatic Control, Vol. 70, No. 4
- A Low-Delay MAC for IoT Applications: Decentralized Optimal Scheduling of Queues Without Explicit State Information Sharing
  IEEE Transactions on Control of Network Systems, Vol. 12, No. 1
- Adaptive fault detection under radio frequency energy-harvesting mechanism
  Discrete and Continuous Dynamical Systems - B, Vol. 30, No. <![CDATA[10]]>
- Timely Offloading in Mobile Edge Cloud Systems
- Minimizing Age of Incorrect Information Over a Channel With Random Delay
  IEEE/ACM Transactions on Networking, Vol. 32, No. 4
- Efficient and Timely Memory Access
- Coexistence Between Task- and Data-Oriented Communications: A Whittle’s Index Guided Multiagent Reinforcement Learning Approach
  IEEE Internet of Things Journal, Vol. 11, No. 2
- Linear Quadratic Zero-Sum Differential Games With Intermittent and Costly Sensing
  IEEE Control Systems Letters, Vol. 8
- Optimal Control and Energy Scheduling for Linear-Quadratic-Gaussian System with Energy Harvesting Constraints
  Pure Mathematics, Vol. 14, No. 04
- Age-Optimal Scheduling Over Hybrid Channels
  IEEE Transactions on Mobile Computing, Vol. 22, No. 12
- Minimizing the Age of Information Over an Erasure Channel for Random Packet Arrivals With a Storage Option at the Transmitter
- Age Minimization with Energy and Distortion Constraints
  16 October 2023
- Optimal Preemption Policy for Age of Information Minimization with Known Packet Length
- Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach
- To Re-Transmit or Not to Re-Transmit for Freshness
- Age-Optimal Low-Power Status Update Over Time-Correlated Fading Channel
  IEEE Transactions on Mobile Computing, Vol. 22, No. 8
- Minimizing Age of Information in Downlink Wireless Networks With Time-Varying Channels and Peak Power Constraint
  IEEE Transactions on Vehicular Technology, Vol. 72, No. 7
- Timely Monitoring of Dynamic Sources With Observations From Multiple Wireless Sensors
  IEEE/ACM Transactions on Networking, Vol. 31, No. 3
- Optimal Scheduling in General Multi-Queue System by Combining Simulation and Neural Network Techniques
  10 June 2023 | Sensors, Vol. 23, No. 12
- A note on the existence of optimal stationary policies for average Markov decision processes with countable states
  Automatica, Vol. 151
- Optimal Sampling for Data Freshness: Unreliable Transmissions With Random Two-Way Delay
  IEEE/ACM Transactions on Networking, Vol. 31, No. 1
- Weighted Age of Information-Based Scheduling for Large Population Games on Networks
  IEEE Journal on Selected Areas in Information Theory, Vol. 4
- Joint Schedule of Task- and Data-Oriented Communications
- Optimal control for parallel queues with a single batch server
  Operations Research Letters, Vol. 50, No. 4
- Optimal Information Update for Energy Harvesting Sensor with Reliable Backup Energy
  11 July 2022 | Entropy, Vol. 24, No. 7
- Optimal Update for Energy Harvesting Sensor with Reliable Backup Energy
- Optimizing Sampling for Data Freshness: Unreliable Transmissions with Random Two-way Delay
- Sleep, Sense or Transmit: Energy-Age Tradeoff for Status Update With Two-Threshold Optimal Policy
  IEEE Transactions on Wireless Communications, Vol. 21, No. 3
- Dispatching to parallel servers
  11 October 2021 | Queueing Systems, Vol. 99, No. 3-4
- Essential Features in a Theory of Context for Enabling Artificial General Intelligence
  16 December 2021 | Applied Sciences, Vol. 11, No. 24
- Optimal Open-Loop Routing and Threshold-Based Allocation in TWO Parallel QUEUEING Systems with Heterogeneous Servers
  31 October 2021 | Mathematics, Vol. 9, No. 21
- A UoI-Optimal Policy for Timely Status Updates with Resource Constraint
  20 August 2021 | Entropy, Vol. 23, No. 8
- Minimizing Age of Information via Scheduling over Heterogeneous Channels
  26 July 2021
- Battle between Rate and Error in Minimizing Age of Information
  26 July 2021
- Channel Quality-Based Optimal Status Update for Information Freshness in Internet of Things
  18 July 2021 | Entropy, Vol. 23, No. 7
- Optimal Sampling and Scheduling for Timely Status Updates in Multi-Source Networks
  IEEE Transactions on Information Theory, Vol. 67, No. 6
- Optimal Transmit Power Allocation for an Energy-Harvesting Sensor in Wireless Cyber-Physical Systems
  IEEE Transactions on Cybernetics, Vol. 51, No. 2
- Scheduling Algorithms for Minimizing Age of Information in Wireless Broadcast Networks with Random Arrivals
  IEEE Transactions on Mobile Computing, Vol. 19, No. 12
- Optimal Control-Aware Transmission for Mission-Critical M2M Communications Under Bandwidth Cost Constraints
  IEEE Transactions on Communications, Vol. 68, No. 9
- Age-Optimal Scheduling for Heterogeneous Traffic with Timely-Throughput Constraint
- Age-of-Information Aware Scheduling under Markovian Energy Arrivals
- Minimizing Age of Information With Power Constraints: Multi-User Opportunistic Scheduling in Multi-State Time-Varying Channels
  IEEE Journal on Selected Areas in Communications, Vol. 38, No. 5
- Algorithmic Analysis of a Two-Class Multi-server Heterogeneous Queueing System with a Controllable Cross-connectivity
  8 November 2020
- Evaluation and Prediction of an Optimal Control in a Processor Sharing Queueing System with Heterogeneous Servers
  2 January 2021
- Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies
  24 August 2020 | SIAM Journal on Control and Optimization, Vol. 58, No. 4
- On the Minimum Pair Approach for Average Cost Markov Decision Processes with Countable Discrete Action Spaces and Strictly Unbounded Costs
  10 March 2020 | SIAM Journal on Control and Optimization, Vol. 58, No. 2
- Age-Optimal Transmission of Rateless Codes in an Erasure Channel
- Optimizing Information Freshness in Broadcast Network with Unreliable Links and Random Arrivals: An Approximate Index Policy
- Only Those Requested Count: Proactive Scheduling Policies for Minimizing Effective Age-of-Information
- Average Age of Information With Hybrid ARQ Under a Resource Constraint
  IEEE Transactions on Wireless Communications, Vol. 18, No. 3
- Can Decentralized Status Update Achieve Universally Near-Optimal Age-of-Information in Wireless Multiaccess Channels?
- Solutions of the average cost optimality equation for Markov decision processes with weakly continuous kernel: The fixed-point approach revisited
  Journal of Mathematical Analysis and Applications, Vol. 464, No. 1
- Delay-Optimal Scheduling for Heterogeneous Users in NOMA Networks
- Age of Information: Whittle Index for Scheduling Stochastic Arrivals
- Average age of information with hybrid ARQ under a resource constraint
- Asymptotic Optimality of Tailored Base-Surge Policies in Dual-Sourcing Inventory Systems
  Linwei Xin,
  David A. Goldberg
  7 February 2017 | Management Science, Vol. 64, No. 1
- Age of information: Design and analysis of optimal scheduling algorithms
- Structures of Optimal Policies in MDPs with Unbounded Jumps: The State of Our Art
  11 March 2017
- Production control policies to maintain service levels in different seasons
  Journal of Manufacturing Systems, Vol. 41
- Hybrid MAC Protocols for Low-Delay Scheduling
- Optimal control of a multiclass queueing system when customers can change types
  28 November 2015 | Queueing Systems, Vol. 82, No. 3-4
- Containment of socially optimal policies in multiple-facility Markovian queueing systems
  21 December 2017 | Journal of the Operational Research Society, Vol. 67, No. 4
- Opportunities for Network Coding: To Wait or Not to Wait
  IEEE/ACM Transactions on Networking, Vol. 23, No. 6
- Value of capacity flexibility in manufacturing systems with seasonal demands
  13 January 2015 | IIE Transactions, Vol. 47, No. 7
- Simple computing of the customer lifetime value: A fixed local-optimal policy approach
  9 December 2014 | Journal of Systems Science and Systems Engineering, Vol. 23, No. 4
- Robust Markov control processes
  Journal of Mathematical Analysis and Applications, Vol. 420, No. 2
- Dynamic Cooperative Secondary Access in Hierarchical Spectrum Sharing Networks
  IEEE Transactions on Wireless Communications, Vol. 13, No. 11
- OPTIMAL CONTROL OF A TWO-SERVER QUEUEING SYSTEM WITH FAILURES
  27 June 2014 | Probability in the Engineering and Informational Sciences, Vol. 28, No. 4
- Network Coding Decisions for Wireless Transmissions With Delay Consideration
  IEEE Transactions on Communications, Vol. 62, No. 8
- Optimal Control of a Server Farm
  16 June 2016 | INFOR: Information Systems and Operational Research, Vol. 51, No. 4
- Transmit Power Control Policies for Energy Harvesting Sensors With Retransmissions
  IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 5
- Cooperate or not: The secondary user's dilemma in hierarchical spectrum sharing networks
- Managing Dynamic Inventory Systems with Product Returns: A Markov Decision Process
  11 October 2012 | Journal of Optimization Theory and Applications, Vol. 157, No. 2
- Opportunities for network coding: To wait or not to wait
- Computation and Dynamic Programming
  15 February 2011
- Towards a Theory of Anonymous Networking
- Delay Optimal Scheduling in a Two-Hop Vehicular Relay Network
  26 May 2009 | Mobile Networks and Applications, Vol. 15, No. 1
- Monotonicity of Constrained Optimal Transmission Policies in Correlated Fading Channels With ARQ
  IEEE Transactions on Signal Processing, Vol. 58, No. 1
- On the slow server problem
  5 January 2010 | Automation and Remote Control, Vol. 70, No. 12
- Machine maintenance with workload considerations
  19 July 2007 | Naval Research Logistics (NRL), Vol. 54, No. 7
- On Optimality of Monotone Channel-Aware Transmission Policies: A Constrained Markov Decision Process Approach
- Serial Agile Production Systems with Automation
  Wallace J. Hopp,
  Seyed M. R. Iravani,
  Biying Shou,
  1 October 2005 | Operations Research, Vol. 53, No. 5
- Optimal Policies for a Capacitated Two-Echelon Inventory System
  Rodney P. Parker,
  Roman Kapuscinski,
  1 October 2004 | Operations Research, Vol. 52, No. 5
- Order arrival control of an M/M/1 failure prone service station
  International Journal of Production Economics, Vol. 91, No. 2
- THE VALUE OF INFORMATION SHARING IN A TWO-STAGE SUPPLY CHAIN WITH PRODUCTION CAPACITY CONSTRAINTS: THE INFINITE HORIZON CASE
  16 April 2004 | Probability in the Engineering and Informational Sciences, Vol. 18, No. 2
- On optimal inventory control with independent stochastic item returns
  European Journal of Operational Research, Vol. 151, No. 1
- Optimal control of queueing networks: an approach via fluid models
  1 July 2016 | Advances in Applied Probability, Vol. 34, No. 02
- Optimal control of queueing networks: an approach via fluid models
  1 July 2016 | Advances in Applied Probability, Vol. 34, No. 2
- Controlling inventories with stochastic item returns: A basic model
  European Journal of Operational Research, Vol. 138, No. 1
- Denumerable state continuous time Markov decision processes with unbounded cost and transition rates under average criterion
  17 February 2009 | The ANZIAM Journal, Vol. 43, No. 4
- Optimality Conditions for CTMDP with Average Cost Criterion
- Convex Stochastic Fluid Programs with Average Cost
  Journal of Mathematical Analysis and Applications, Vol. 259, No. 1
- COMBINED PRODUCTION AND MAINTENANCE SCHEDULING FOR A MULTIPLE‐PRODUCT, SINGLE‐ MACHINE PRODUCTION SYSTEM
  1 December 2000 | Production and Operations Management, Vol. 9, No. 4
- Optimal Control of Markov Chains
- Combined Pricing and Inventory Control Under Uncertainty
  Awi Federgruen,
  Aliza Heching,
  1 June 1999 | Operations Research, Vol. 47, No. 3
- The value iteration method for countable state Markov decision processes
  Operations Research Letters, Vol. 24, No. 5
- Inventory Control for Joint Manufacturing and Remanufacturing
- Stochastic models for inventory management at service facilities
  Communications in Statistics. Stochastic Models, Vol. 15, No. 4
- Sample-path average optimality for Markov control processes
  IEEE Transactions on Automatic Control, Vol. 44, No. 10
- Application of average dynamic programming to inventory systems
  Mathematical Methods of Operations Research, Vol. 47, No. 3
- Bibliography
  27 May 2008
- Assigning a single server to inhomogeneous queues with switching costs
  Theoretical Computer Science, Vol. 182, No. 1-2
- Optimal service control against worst case admission policies: A multichained stochastic game
  Mathematical Methods of Operations Research, Vol. 45, No. 2
- Policy Iteration for Average Cost Markov Control Processes on Borel Spaces
  Acta Applicandae Mathematica, Vol. 47, No. 2
- The Computation of Average Optimal Policies in Denumerable State Markov Decision Chains
  1 July 2016 | Advances in Applied Probability, Vol. 29, No. 01
- The Computation of Average Optimal Policies in Denumerable State Markov Decision Chains
  1 July 2016 | Advances in Applied Probability, Vol. 29, No. 1
- On computing average cost optimal policies with application to routing to parallel queues
  Mathematical Methods of Operations Research, Vol. 45, No. 1
- Stochastic Inventory Models with Limited Production Capacity and Periodically Varying Parameters
  27 July 2009 | Probability in the Engineering and Informational Sciences, Vol. 11, No. 1
- The policy iteration algorithm for average reward Markov decision processes with general state space
  IEEE Transactions on Automatic Control, Vol. 42, No. 12
- Value Iteration in a Class of Communicating Markov Decision Chains with the Average Cost Criterion
  SIAM Journal on Control and Optimization, Vol. 34, No. 6
- Optimal batch service of a polling system under partial information
  Mathematical Methods of Operations Research, Vol. 44, No. 3
- Denumerable controlled Markov chains with strong average optimality criterion: Bounded & unbounded costs
  Mathematical Methods of Operations Research, Vol. 43, No. 3
- The convergence of value iteration in average cost Markov decision chains
  Operations Research Letters, Vol. 19, No. 1
- Convergence of the policy iteration algorithm with applications to queueing networks and their fluid models
- Average Optimality in Markov Control Processes via Discounted-Cost Problems and Linear Programming
  SIAM Journal on Control and Optimization, Vol. 34, No. 1
- Optimal routing into two heterogeneous service stations with delayed information
  IEEE Transactions on Automatic Control, Vol. 40, No. 7
- A note on the vanishing interest rate approach in average Markov decision chains with continuous and bounded costs
  Systems & Control Letters, Vol. 24, No. 5
- Another set of conditions for average optimality in Markov control processes
  Systems & Control Letters, Vol. 24, No. 2
- Optimal routing of customers with general independent interarrival times in deterministic parallel queues
  IEEE Transactions on Automatic Control, Vol. 40, No. 9
- Non-standard optimality criteria for stochastic control problems
- Zero-sum stochastic games with unbounded costs: Discounted and average cost cases
  ZOR Zeitschrift f� Operations Research Methods and Models of Operations Research, Vol. 39, No. 2
- Nonzero-sum stochastic games with unbounded costs: Discounted and average cost cases
  ZOR Zeitschrift f�r Operations Research Mathematical Methods of Operations Research, Vol. 40, No. 2
- Bibliography
  27 May 2008
- Weak conditions for average optimality in Markov control processes
  Systems & Control Letters, Vol. 22, No. 4
- Jointly optimal admission and routing controls at a network node
  Communications in Statistics. Stochastic Models, Vol. 10, No. 1
- Denumerable controlled Markov chains with strong average optimality criterion: bounded and unbounded costs
- A survey of Markov decision models for control of networks of queues
  Queueing Systems, Vol. 13, No. 1-3
- Optimal flow control allocation policies in communication networks with multiple message classes
  IEEE Transactions on Automatic Control, Vol. 38, No. 3
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
  SIAM Journal on Control and Optimization, Vol. 31, No. 2
- The Average Cost Optimality Equation and Critical Number Policies
  27 July 2009 | Probability in the Engineering and Informational Sciences, Vol. 7, No. 1
- Constrained Average Cost Markov Decision Chains
  27 July 2009 | Probability in the Engineering and Informational Sciences, Vol. 7, No. 1
- Guaranteed performance regions in Markovian systems with competing decision makers
  IEEE Transactions on Automatic Control, Vol. 38, No. 1
- Optimal service control of a station connected with two parallel substations
  IEEE Transactions on Automatic Control, Vol. 38, No. 10
- Optimal control of the M/G/1 queue with repeated vacations of the server
  IEEE Transactions on Automatic Control, Vol. 38, No. 12
- Weak conditions for average optimality in Markov control processes
- Optimal routing into two heterogeneous service stations with delayed information
- Stochastic games and flow control models
- Time-average and asymptotically optimal flow control policies in networks with multiple transmitters
  Annals of Operations Research, Vol. 35, No. 5
- On strong average optimality of markov decision processes with unbounded costs
  Operations Research Letters, Vol. 11, No. 2
- Comparing recent assumptions for the existence of average optimal stationary policies
  Operations Research Letters, Vol. 11, No. 1
- Average Optimality of Markov Decision Processes with Unbounded Costs
- Structured solutions for stochastic control problems
- Guaranteed performance regions for multi-user Markov models
- Optimal Scheduling of a Finite Capacity Shuttle under Delayed Information
  27 July 2009 | Probability in the Engineering and Informational Sciences, Vol. 6, No. 1
- Optimal control of the M/G/1 queue with repeated vacations of the server
- Convex stochastic control problems
- Recent results on conditions for the existence of average optimal stationary policies
  Annals of Operations Research, Vol. 28, No. 1
- Denumerable semi-Markov decision chains with small interest rates
  Annals of Operations Research, Vol. 28, No. 1
- Value iteration in countable state average cost Markov decision processes with unbounded costs
  Annals of Operations Research, Vol. 28, No. 1
- The existence of sensitive optimal policies in two multi-dimensional queueing models
  Annals of Operations Research, Vol. 28, No. 1
- On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes
  Annals of Operations Research, Vol. 29, No. 1
- Constrained Discounted Markov Decision Chains
  27 July 2009 | Probability in the Engineering and Informational Sciences, Vol. 5, No. 4
- Average optimality in dynamic programming on Borel spaces — unbounded costs and controls
  Systems & Control Letters, Vol. 17, No. 3
- Markov Decision Problems and State-Action Frequencies
  SIAM Journal on Control and Optimization, Vol. 29, No. 4
- Optimality of threshold policies in single-server queueing systems with server vacations
  1 July 2016 | Advances in Applied Probability, Vol. 23, No. 02
- Optimality of threshold policies in single-server queueing systems with server vacations
  1 July 2016 | Advances in Applied Probability, Vol. 23, No. 2
- A counterexample on the optimality equation in Markov decision chains with the average cost criterion
  Systems & Control Letters, Vol. 16, No. 5
- Optimal allocation of a server between two queues with due times
  IEEE Transactions on Automatic Control, Vol. 36, No. 12
- Optimal flow control allocation policies in communication networks with priorities
- Time-average and asymptotically optimal flow control policies in networks with multiple transmitters
- Remarks on the existence of solutions to the average cost optimality equation in Markov decision processes
  Systems & Control Letters, Vol. 15, No. 5
- Average cost optimal policies for Markov control processes with Borel state space and unbounded costs
  Systems & Control Letters, Vol. 15, No. 4
- Ergodic control of Markov chains
- On partially observable Markov decision processes with an average cost criterion

Volume 37, Issue 4

July-August 1989

Pages 514-673

Article Information

Metrics

Information

Published Online:August 01, 1989

Cite as

Linn I. Sennott, (1989) Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs. Operations Research 37(4):626-633.

https://doi.org/10.1287/opre.37.4.626

Keywords

dynamic programming infinite state Markov decision processes, average cost: queueing control models

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Average Cost Optimal Stationary Policies in Infinite State Markov Decision Processes with Unbounded Costs

Abstract

Volume 37, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News