RLAI Reinforcement Learning and Artificial Intelligence (RLAI
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: safely approximating the value function. In Advances in Neural Information Processing Systems, San Mateo, CA, 1995.

 
Author: Anna October, 2004
Download Here
First Author HomePage


Abstract:

 
    A straightforward approach to the curse of dimensionality in reinforcement learning and dynamic programming is to replace the lookup table with a generalizing function approximator such as a neural net. Although this has been successful in the domain of backgammon, there is no guarantee of convergence. In this paper, we show that the combination of dynamic programming and function approximation is not robust, and in even very benign cases, may produce an entirely wrong policy. We then introduce Grow-Support, a new algorithm which is safe from divergence yet can still reap the benefits of successful generalization.


Keywords:
Reinforcement Learning, Function Approximation, Dynammic Programming
 

Bibtex:

@inproceedings{ boyan95generalization,
author = "Justin A. Boyan and Andrew W. Moore",
title = "Generalization in Reinforcement Learning: {S}afely Approximating the Value Function",
booktitle = "Advances in Neural Information Processing Systems 7",
publisher = "The MIT Press",
address = "Cambridge, MA",
editor = "G. Tesauro and D. S. Touretzky and T. K. Leen",
pages = "369--376",
year = "1995",
url = "citeseer.ist.psu.edu/boyan95generalization.html"}


Comments: