On Sequential Decisions and Markov Chains

Cyrus Derman
Cyrus Derman
Columbia University and Technion, Israel Institute of Technology
Search for more papers by this author

Columbia University and Technion, Israel Institute of Technology

Published Online:1 Oct 1962https://doi.org/10.1287/mnsc.9.1.16

Abstract

Several problems in the optimal control of dynamic systems are considered. When observed, a system is classifiable into one of a finite number of states and controlled by making one of a finite number of decisions. The sequence of observed states is a stochastic process dependent upon the sequence of decisions, in that the decisions determine the probability laws that operate on the system. Costs are associated with the sequence of states and decisions. It is shown that, for the problems considered, the optimal rules for controlling the system belong to a subclass of all possible rules and, within this subclass, the optimal rules can be derived by solving linear programming problems.

Cited by
- Revenue Management Under a Price Alert Mechanism
  Bo Jiang,
  Zizhuo Wang,
  Nanxi Zhang
  22 April 2026 | Management Science, Vol. 0, No. 0
- Risk-sensitive Markov-perfect equilibrium
  25 June 2025 | Annals of Operations Research, Vol. 43
- Markov Processes
  4 March 2025
- MF-OMO: An Optimization Formulation of Mean-Field Games
  22 January 2024 | SIAM Journal on Control and Optimization, Vol. 62, No. 1
- Risk-Sensitive Markov-Perfect Equilibrium
  1 January 2024 | SSRN Electronic Journal, Vol. 39
- Submixing and shift-invariant stochastic games
  25 May 2023 | International Journal of Game Theory, Vol. 52, No. 4
- Approximate Dynamic Programming: Linear Programming-Based Approaches
  21 September 2022
- Fuzzy-Petri-Networks in Supervisory Control of Markov Processes in Robotized FMS and Robotic Systems
  18 April 2023
- On Zero-Sum Two Person Perfect Information Semi-Markov Games
  1 August 2023
- On the design and analysis of near-term quantum network protocols using Markov decision processes
  30 September 2022 | AVS Quantum Science, Vol. 4, No. 3
- Exact Dynamic Programming
  8 April 2022
- Tail-behavior roadmap for sharp restart
  11 March 2021 | Journal of Physics A: Mathematical and Theoretical, Vol. 54, No. 12
- Mean-performance of sharp restart I: statistical roadmap
  3 September 2020 | Journal of Physics A: Mathematical and Theoretical, Vol. 53, No. 40
- Optimal a priori tour and restocking policy for the single-vehicle routing problem with stochastic demands
  European Journal of Operational Research, Vol. 285, No. 1
- Computing semi-stationary optimal policies for multichain semi-Markov decision processes
  14 October 2017 | Annals of Operations Research, Vol. 287, No. 2
- Dynamic Maintenance, Production and Inspection Policies, for a Single-Stage, Multi-State Production System
  IEEE Access, Vol. 8
- Maintaining systems with heterogeneous spare parts
  18 July 2019 | Naval Research Logistics (NRL), Vol. 66, No. 6
- Optimization of a fully adaptive quality and maintenance model in the presence of multiple location and scale quality shifts
  Applied Mathematical Modelling, Vol. 54
- Semi-Markov decision processes with limiting ratio average rewards
  Journal of Mathematical Analysis and Applications, Vol. 455, No. 1
- Polynomial-Time Computation of Strong and n-Present-Value Optimal Policies in Markov Decision Chains
  Michael O’Sullivan,
  Arthur F. Veinott
  14 December 2016 | Mathematics of Operations Research, Vol. 42, No. 3
- Maintenance planning for a deteriorating production process
  Reliability Engineering & System Safety, Vol. 159
- Optimality Conditions for Inventory Control
  Eugene A. Feinberg
  4 November 2016
- A Duality Framework for Stochastic Optimal Control of Complex Systems
  IEEE Transactions on Automatic Control, Vol. 61, No. 10
- Synthesizing efficient systems in probabilistic environments
  13 May 2015 | Acta Informatica, Vol. 53, No. 4
- Perspectives of approximate dynamic programming
  7 February 2012 | Annals of Operations Research, Vol. 241, No. 1-2
- Impact of supply risks on procurement decisions
  10 July 2013 | Annals of Operations Research, Vol. 241, No. 1-2
- On undiscounted semi-Markov decision processes with absorbing states
  16 December 2015 | Mathematical Methods of Operations Research, Vol. 83, No. 2
- Jointly optimal policies for pavement maintenance, resurfacing and reconstruction
  EURO Journal on Transportation and Logistics, Vol. 4, No. 1
- Repair, Inspection, and Replacement Models
  29 September 2014
- Maximum-Stopping-Value Policies in Finite Markov Population Decision Chains
  B. Curtis Eaves,
  Arthur F. Veinott
  24 January 2014 | Mathematics of Operations Research, Vol. 39, No. 3
- Optimal maintenance scheduling for a complex manufacturing system subject to deterioration
  8 February 2014 | Annals of Operations Research, Vol. 217, No. 1
- A partial history of the early development of continuous-time nonlinear stochastic systems theory
  Automatica, Vol. 50, No. 2
- Markov Processes
  8 November 2013
- Derman’s book as inspiration: some results on LP for MDPs
  4 January 2012 | Annals of Operations Research, Vol. 208, No. 1
- Discounting axioms imply risk neutrality
  8 February 2012 | Annals of Operations Research, Vol. 208, No. 1
- On the life and work of Cyrus Derman
  23 July 2013 | Annals of Operations Research, Vol. 208, No. 1
- Optimal joint maintenance and operation policies to maximise overall systems effectiveness
  International Journal of Production Research, Vol. 51, No. 5
- Strong polynomiality of the Gass-Saaty shadow-vertex pivoting rule for controlled random walks
  1 August 2012 | Annals of Operations Research, Vol. 201, No. 1
- Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
  Eugene A. Feinberg,
  Pavlo O. Kasyanov,
  Nina V. Zadoianchuk,
  5 September 2012 | Mathematics of Operations Research, Vol. 37, No. 4
- The Rail Quality Index as an Indicator of the “Global Comfort” in Optimizing Safety, Quality and Efficiency in Railway Rails
  Procedia - Social and Behavioral Sciences, Vol. 53
- An intelligent fuzzy-Petri reasoning supervisory control for FMS manufacturing plants
- FMS intelligent supervision and control 2: Communications among machines under FPSC
- Trend Curve Optimal Control Model for Optimizing Pavement Maintenance Strategies Consisting of Various Treatments
  7 February 2012 | Computer-Aided Civil and Infrastructure Engineering, Vol. 27, No. 3
- Synthesizing Efficient Controllers
- Equilibrium control policies for Markov chains
- Maintenance scheduling of a manufacturing system subject to deterioration
  Reliability Engineering & System Safety, Vol. 96, No. 10
- Bibliography
  26 September 2011
- A Bayesian model for the joint optimization of quality and maintenance decisions
  21 February 2011 | Quality and Reliability Engineering International, Vol. 27, No. 2
- Synthesizing Systems with Optimal Average-Case Behavior for Ratio Objectives
  18 February 2011 | Electronic Proceedings in Theoretical Computer Science, Vol. 50
- Sensitivity Analysis and Dynamic Programming
  15 February 2011
- Optimal threshold probability and expectation in semi-Markov decision processes
  Applied Mathematics and Computation, Vol. 216, No. 10
- A Strongly Polynomial Algorithm for Controlled Queues
  Alexander Zadorojniy,
  Guy Even,
  Adam Shwartz,
  7 October 2009 | Mathematics of Operations Research, Vol. 34, No. 4
- Zustandsorientierte Maschinenzuordnungs- und Instandhaltungsplanung
  10 November 2009 | Zeitschrift für Betriebswirtschaft, Vol. 79, No. 11
- An economically designed, integrated quality and maintenance model using an adaptive Shewhart chart
  Reliability Engineering & System Safety, Vol. 94, No. 3
- Combinatorial Design of a Stochastic Markov Decision Process
- Optimization of inspection and maintenance decisions for infrastructure facilities under performance model uncertainty: A quasi-Bayes approach
  Transportation Research Part A: Policy and Practice, Vol. 42, No. 8
- Repair, Inspection, and Replacement Models
  15 September 2008
- Stochastic shortest path problems with associative accumulative criteria
  Applied Mathematics and Computation, Vol. 198, No. 1
- Choosing Among Living-Donor and Cadaveric Livers
  Oguzhan Alagoz,
  Lisa M. Maillart,
  Andrew J. Schaefer,
  Mark S. Roberts,
  21 September 2007 | Management Science, Vol. 53, No. 11
- Machine maintenance with workload considerations
  19 July 2007 | Naval Research Logistics (NRL), Vol. 54, No. 7
- SEMI-MARKOV DECISION PROCESSES
  22 October 2007 | Probability in the Engineering and Informational Sciences, Vol. 21, No. 4
- Optimal preventive maintenance for equipment with two quality states and general failure time distributions
  European Journal of Operational Research, Vol. 180, No. 1
- A time series analysis framework for transportation infrastructure management
  Transportation Research Part B: Methodological, Vol. 41, No. 5
- Determining the Acceptance of Cadaveric Livers Using an Implicit Model of the Waiting List
  Oguzhan Alagoz,
  Lisa M. Maillart,
  Andrew J. Schaefer,
  Mark S. Roberts,
  1 February 2007 | Operations Research, Vol. 55, No. 1
- Marginally Monotonic Maintenance Policies for a Multi-State Deteriorating Machine With Probabilistic Monitoring, and Silent Failures
  IEEE Transactions on Reliability, Vol. 54, No. 3
- A survey on the bandit problem with switching costs
  De Economist, Vol. 152, No. 4
- The Optimal Timing of Living-Donor Liver Transplantation
  Oguzhan Alagoz,
  Lisa M. Maillart,
  Andrew J. Schaefer,
  Mark S. Roberts,
  1 October 2004 | Management Science, Vol. 50, No. 10
- Replacement Decisions with Maintenance Under Uncertainty: An Imbedded Optimal Control Model
  Ali Dogramaci,
  Nelson M. Fraiman,
  1 October 2004 | Operations Research, Vol. 52, No. 5
- Optimal threshold probability in undiscounted Markov decision processes with a target set
  Applied Mathematics and Computation, Vol. 149, No. 2
- Introduction
- Finite State and Action MDPS
- An algorithm to convert wafer to calendar-based preventive maintenance schedules for semiconductor manufacturing systems
- On One Property of a Regular Markov Chain
  Ukrainian Mathematical Journal, Vol. 54, No. 4
- Numerical Comparison of Controls and Verification of Optimality for Stochastic Control Problems
  Journal of Optimization Theory and Applications, Vol. 106, No. 1
- Approximation of Infinite-Dimensional Linear Programming Problems which Arise in Stochastic Control
  SIAM Journal on Control and Optimization, Vol. 36, No. 4
- Existence of Markov Controls and Characterization of Optimal Markov Controls
  SIAM Journal on Control and Optimization, Vol. 36, No. 2
- Управляемые случайные последовательности: методы выпуклого анализа и задачи с функциональными ограничениями
  Успехи математических наук, Vol. 53, No. 6
- The tolerance approach in multiobjective linear fractional programming
  Top, Vol. 5, No. 2
- Fractional Programming
- A framework for developing maintenance policyfor flexible manufacturing systems
- Markov Branching Decision Chains with Interest-Rate-Dependent Rewards
  27 July 2009 | Probability in the Engineering and Informational Sciences, Vol. 9, No. 1
- Constrained Semi-Markov decision processes with average rewards
  ZOR Zeitschrift f�r Operations Research Mathematical Methods of Opeartions Research, Vol. 39, No. 3
- Look-back policies for two-stage, pull-type production/inventory systems
  Annals of Operations Research, Vol. 48, No. 4
- Bibliography
  27 May 2008
- Discrete-Time Controlled Markov Processes with Average Cost Criterion: A Survey
  SIAM Journal on Control and Optimization, Vol. 31, No. 2
- 6 Algorithms and complexity for markov processes
- Preventive replacement in systems with dependent components
  IEEE Transactions on Reliability, Vol. 41, No. 2
- An infinite-dimensional LP solution to control of a continuous, monotone process
- An optimal buffer management policy for high-performance packet switching
- Ergodic control of Markov chains
- Equivalence of various linearization algorithms for linear fractional programming
  ZOR Zeitschrift f�r Operations Research Methods and Models of Operations Research, Vol. 33, No. 1
- Optimal queueing systems controls with finite buffers and with multiple component cost functions
  IEEE Transactions on Systems, Man, and Cybernetics, Vol. 19, No. 2
- Optimal dynamic routing in Flexible Manufacturing Systems with limited buffers
  Annals of Operations Research, Vol. 15, No. 1
- Optimal maintenance policies for constantly monitored systems
  Naval Research Logistics, Vol. 35, No. 4
- Analytical Framework for Optimizing Pavement Maintenance
  Journal of Transportation Engineering, Vol. 114, No. 3
- Fractional Programming
- Broadcast delivery
  Proceedings of the IEEE, Vol. 76, No. 12
- On the Optimality of Cyclic Transmission in Teletext Systems
  IEEE Transactions on Communications, Vol. 35, No. 1
- A Framework for Replacement Modeling Assumptions
  The Engineering Economist, Vol. 32, No. 1
- Some basic concepts of numerical treatment of Markov decision models
  Statistics, Vol. 17, No. 1
- On the optimality of cyclic transmission in teletext systems
- State information lag markov decision process with control limit rule
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 32, No. 3
- Conditions for the existence of decision horizons for discounted problems in a stochastic environment: A note
  Operations Research Letters, Vol. 4, No. 2
- On maximizing the average time at a goal
  Stochastic Processes and their Applications, Vol. 17, No. 2
- Optimal control of entry to an M / Ek /1 queue serving several classes of customers
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 30, No. 2
- Fractional programming
  European Journal of Operational Research, Vol. 12, No. 4
- Adaptive Policies in Markov Decision Processes with Uncertain Transition Matrices
  Journal of Information and Optimization Sciences, Vol. 4, No. 1
- Controlled Markov Processes with Arbitrary Numerical Criteria
  Theory of Probability & Its Applications, Vol. 27, No. 3
- Bibliography in fractional programming
  Zeitschrift für Operations Research, Vol. 26, No. 1
- Reliability analysis : Optimal inspection and maintenance schedules of failing systems
  Microelectronics Reliability, Vol. 22, No. 1
- On Semi-Markov Controlled Models with an Average Reward Criterion
  Theory of Probability & Its Applications, Vol. 26, No. 4
- Monotone optimal preventive maintenance policies for stochastically failing equipment
  5 July 2007 | Naval Research Logistics Quarterly, Vol. 28, No. 3
- Optimal control for entry of many classes of customers to an M/M/ 1 queue
  5 July 2007 | Naval Research Logistics Quarterly, Vol. 28, No. 3
- Learning Control of Finite Markov Chains
  IFAC Proceedings Volumes, Vol. 14, No. 2
- Reliability—risks, resources, rewards
  Reliability Engineering, Vol. 2, No. 3
- Fractional programming: Applications and algorithms
  European Journal of Operational Research, Vol. 7, No. 2
- Optimal maintenance models for systems subject to failure–A Review
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 28, No. 1
- Optimal control of random walks, birth and death processes, and queues
  1 July 2016 | Advances in Applied Probability, Vol. 13, No. 01
- Optimal control of random walks, birth and death processes, and queues
  1 July 2016 | Advances in Applied Probability, Vol. 13, No. 1
- An $\varepsilon $-Optimal Control of a Finite Markov Chain with an Average Reward Criterion
  Theory of Probability & Its Applications, Vol. 25, No. 1
- Capital accumulation and the optimization of renewable resource models
  Journal of Economic Theory, Vol. 23, No. 2
- Non-terminating stochastic ratio game
  29 March 2011 | RAIRO - Operations Research, Vol. 14, No. 1
- The Existence of a Stationary $\varepsilon $-Optimal Policy for a Finite Markov Chain
  Theory of Probability & Its Applications, Vol. 23, No. 2
- Developing an optimal repair‐replacement strategy for pallets
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 25, No. 1
- An Efficient Computational Alternative to ‘Using Linear Programming to Design Oil Pollution Detection Schedules'
  9 July 2007 | A I I E Transactions, Vol. 10, No. 1
- Fractional and complex programming
- Markov ratio decision processes
  Journal of Optimization Theory and Applications, Vol. 21, No. 1
- On Integer and Mixed Integer Fractional Programming Problems
- A Finite Controlled Markov Chain with Small Termination Probability
  Theory of Probability & Its Applications, Vol. 21, No. 1
- A survey of maintenance models: The control and surveillance of deteriorating systems
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 23, No. 3
- On Controlled Finite State Markov Processes with Compact Control Sets
  Theory of Probability & Its Applications, Vol. 20, No. 4
- An algorithm for solving general fractional interval programming problems
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 23, No. 1
- Optimal and suboptimal procedures in group sequential sampling
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 21, No. 1
- An explicit general solution in linear fractional programming
  21 November 2006 | Naval Research Logistics Quarterly, Vol. 20, No. 3
- Optimal control of batch service queues
  1 July 2016 | Advances in Applied Probability, Vol. 5, No. 02
- Optimal control of batch service queues
  1 July 2016 | Advances in Applied Probability, Vol. 5, No. 2
- On the solvability of Bellman's functional equation for a Markovian decision process
  Journal of Mathematical Analysis and Applications, Vol. 42, No. 2
- The Scheduling of a Multi-Product Facility
- An optimal maintenance policy for deep submergence equipment, part 2
- Economic Screening of a Continuously Manufactured Product
  Technometrics, Vol. 14, No. 3
- On Group Sequential Sampling
  Technometrics, Vol. 14, No. 1
- A new optimality criterion for discrete dynamic programming
  Journal of Mathematical Analysis and Applications, Vol. 37, No. 1
- References
- An optimal maintenance policy for deep submergence equipment
- On the optimal long-run control of Markov renewal processes
  Journal of Mathematical Analysis and Applications, Vol. 36, No. 1
- A Markovian reset problem
  IEEE Transactions on Automatic Control, Vol. 15, No. 1
- Surveillance Problems: Poisson Process under Costly Surveillance
  SIAM Journal on Control, Vol. 7, No. 4
- Continuous Time Markovian Sequential Control Processes
  SIAM Journal on Control, Vol. 7, No. 3
- Linear programming considerations on Markovian Decision Processes with no discounting
  Journal of Mathematical Analysis and Applications, Vol. 26, No. 1
- Optimization of stationary control of a discrete deterministic process
  Cybernetics, Vol. 3, No. 2
- Note on sequential search
  29 October 2013 | Naval Research Logistics Quarterly, Vol. 15, No. 3
- Some remarks on a Markovian decision problem with an absorbing state
  Journal of Mathematical Analysis and Applications, Vol. 23, No. 2
- Linear programming algorithms for semi-Markovian decision processes
  Journal of Mathematical Analysis and Applications, Vol. 22, No. 2
- Multichain Markov Renewal Programs
  SIAM Journal on Applied Mathematics, Vol. 16, No. 3
- A survey of some recent Czechoslovak work in automatic statistical process control
  14 July 2016 | Journal of Applied Probability, Vol. 5, No. 1
- Bibliographie zur statistischen Entscheidungstheorie 1950–1967 / Bibliography of Statistical Decision Theory 1950–1967
- Computation and Structure of Optimal Reset Policies
  Journal of the American Statistical Association, Vol. 62, No. 320
- Surveillance Problems: Two-Dimensional with Continuous Surveillance
  SIAM Journal on Control, Vol. 5, No. 2
- Contraction Mappings in the Theory Underlying Dynamic Programming
  SIAM Review, Vol. 9, No. 2
- On some stocxastic tactical antisubmarine games
  11 August 2006 | Naval Research Logistics Quarterly, Vol. 14, No. 3
- Bibliography
- Programming and Control Problems Arising from Optimal Routing in Telephone Networks*
  29 July 2013 | Bell System Technical Journal, Vol. 45, No. 9
- Markov Renewal Programming by Linear Fractional Programming
  SIAM Journal on Applied Mathematics, Vol. 14, No. 6
- The optimization of K-Effect Models by linear and dynamic programming
  Journal of Mathematical Analysis and Applications, Vol. 13, No. 3
- RELIABILITY-MAINTAINABILITY COST TRADE-OFF VIA DYNAMIC AND LINEAR PROGRAMMING
  16 August 1966
- Markov Decision Processes with Finite State and Decision Spaces
  Theory of Probability & Its Applications, Vol. 11, No. 2
- Water quality improvement programming problems
  9 July 2010 | Water Resources Research, Vol. 1, No. 4
- Markovian sequential control processes—Denumerable state space
  Journal of Mathematical Analysis and Applications, Vol. 10, No. 2
- A "Time-domain" successive approximation method for some linear optimal stochastic systems
  IEEE Transactions on Automatic Control, Vol. 9, No. 3
- Stable sequential control rules and Markov chains
  Journal of Mathematical Analysis and Applications, Vol. 6, No. 2

Volume 9, Issue 1

October 1962

Pages 1-169

Article Information

Metrics

Information

Published Online:October 01, 1962

Cite as

Cyrus Derman, (1962) On Sequential Decisions and Markov Chains. Management Science 9(1):16-24.

https://doi.org/10.1287/mnsc.9.1.16

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

On Sequential Decisions and Markov Chains

Abstract

Volume 9, Issue 1

Article Information

Metrics

Information

Cite as

Sign Up for INFORMS Publications Updates and News