A Hybrid Genetic/Optimization Algorithm for Finite-Horizon, Partially Observed Markov Decision Processes

Zong-Zhi Lin
Zong-Zhi Lin
[email protected]
Department of Operations Research, General Motors Corp., Mail Code 483-585-372, 585 South Boulevard, Pontiac, Michigan 48341, USA
Search for more papers by this author
,
James C. Bean
James C. Bean
[email protected]
Department of Industrial and Operations Engineering, 1205 Beal Avenue, The University of Michigan, Ann Arbor, Michigan 48109-2117, USA
Search for more papers by this author
,
Chelsea C. White
Chelsea C. White
[email protected]
School of Industrial Systems and Engineering, Georgia Institute of Technology, P.O. Box 173364, Atlanta, Georgia 80217-3364, USA
Search for more papers by this author

Department of Operations Research, General Motors Corp., Mail Code 483-585-372, 585 South Boulevard, Pontiac, Michigan 48341, USA

Search for more papers by this author

James C. Bean

[email protected]

Department of Industrial and Operations Engineering, 1205 Beal Avenue, The University of Michigan, Ann Arbor, Michigan 48109-2117, USA

Search for more papers by this author

Chelsea C. White

[email protected]

School of Industrial Systems and Engineering, Georgia Institute of Technology, P.O. Box 173364, Atlanta, Georgia 80217-3364, USA

Search for more papers by this author

Published Online:1 Feb 2004https://doi.org/10.1287/ijoc.1020.0024

Abstract

The partially observed Markov decision process (POMDP) is a generalization of a Markov decision process that allows for noise-corrupted and costly observations of the underlying system state. The value function of the infinite horizon POMDP is known to be piecewise affine and convex in the probability mass vector over the state space. Such a function can be represented by a finite set of affine functions.

In this paper, we develop and evaluate an exact algorithm, GAMIP, which combines a genetic algorithm and a mixed integer program to construct the minimal set of affine functions that describes the value function. Numerical results indicate that GAMIP takes up to 60% less time to construct the minimal set than does the most efficient linear programming-based exact solution method in the literature.

cover image INFORMS Journal on Computing

Volume 16, Issue 1

Winter 2004

Pages 1-105

Article Information

Metrics

Information

Received:January 01, 2000
Accepted:November 01, 2002
Published Online:February 01, 2004

Cite as

Zong-Zhi Lin, James C. Bean, Chelsea C. White, (2004) A Hybrid Genetic/Optimization Algorithm for Finite-Horizon, Partially Observed Markov Decision Processes. INFORMS Journal on Computing 16(1):27-38.

https://doi.org/10.1287/ijoc.1020.0024

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

A Hybrid Genetic/Optimization Algorithm for Finite-Horizon, Partially Observed Markov Decision Processes

Abstract

Volume 16, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News