An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes
Published Online:1 May 2007https://doi.org/10.1287/ijoc.1050.0155
References
- A genetic search in policy space for solving Markov decision processes. AAAI Spring Sympos. Search Techniques Problem Solving Under Uncertainty and Incomplete Inform. (1999) Stanford University, Stanford, CAGoogle Scholar
- Multidimensional binary search trees in database applications. IEEE Trans. Software Engrg. (1979) 5:333–340Crossref, Google Scholar
- Dynamic Programming and Optimal Control (1995) (Athena Scientific, Belmont, MA) . Vols. 1 and 2Google Scholar
- Adaptive aggregation methods for infinite horizon dynamic programming. IEEE Trans. Automatic Control (1989) 34:589–598Crossref, Google Scholar
- Parallel rollout for online solution of partially observable Markov decision processes. Discrete Event Dynamic Systems: Theory Application (2004) 14:309–341Crossref, Google Scholar
- Evolutionary policy iteration for solving Markov decision processes. IEEE Trans. Automatic Control. (2005) 50:1804–1808Crossref, Google Scholar
- The linear programming approach to approximate dynamic programming. Oper. Res. (2003) 51:850–865Link, Google Scholar
- Applied Numerical Linear Algebra (1997) (Soc. Indust. Appl. Math., Philadelphia, PA) Crossref, Google Scholar
- R-trees: A dynamic index structure for spatial searching. Proc. 1984 Association for Computing Machinery Special Interest Group on Management of Data (1984) (ACM Press, New York) 47–57Crossref, Google Scholar
- , Glover F., Kochenberger G. Iterated local search. Handbook on MetaHeuristics (2002) (Kluwer Academic Publishers, Boston, MA) 321–353Google Scholar
- A modified dynamic programming method for Markovian decision problems. J. Math. Anal. Appl. (1966) 14:38–43Crossref, Google Scholar
- Markov Decision Processes: Discrete Stochastic Dynamic Programming (1994) (Wiley, New York) Crossref, Google Scholar
- Using randomization to break the curse of dimensionality. Econometrica (1997) 65:487–516Crossref, Google Scholar
- Genetic algorithms: A survey. IEEE Comput. (1994) 27:17–26Crossref, Google Scholar
- Spline approximations to value functions: A linear programming approach. Macroeconomic Dynam. (1997) 1:255–277Crossref, Google Scholar
- Feature-based methods for large-scale dynamic programming. Machine Learning (1996) 22:59–94Crossref, Google Scholar
- Genetic algorithms for approximating solutions to POMDPs. (1999) . Department of Computer Science Technical Report TR-290-99, University of Kentucky, Lexington, KY, http://citeseer.ist.psu.edu/277136.htmlGoogle Scholar

