
Mountain Car References
[Moore, 1990] A. Moore, Efficient
Memory-Based
Learning for Robot Control, PhD thesis, University of Cambridge,
November 1990. http://citeseer.ist.psu.edu/moore90efficient.html
[Singh and Sutton, 1996] Singh, S.P. and Sutton, R.S. (1996)
Reinforcement learning with replacing eligibility traces. Machine
Learning 22(1/2/3):123-158.
http://citeseer.ist.psu.edu/singh96reinforcement.html
[Sutton and Barto, 1998] Reinforcement Learning:. An Introduction. Richard S.
Sutton and Andrew G. Barto. A Bradford Book. The MIT Press Cambridge,
Massachusetts London, England, 1998
[Smart and Kaelbling, 2000] Smart, W. D. and Kaelbling,
L. P. (2000). Practical Reinforcement Learning in Continuous Spaces. In
Proc. 17th International Conf. on Machine Learning, pages 903–910.
Morgan Kaufmann, San Francisco, CA.
[Boyan and Moore, 1995] Boyan, J. A. and Moore, A. W.
(1995). Generalization in Reinforcement Learning: Safely Approximating
the Value Function. In Tesauro, G., Touretzky, D. S., and Leen, T. K.,
editors, Advances in Neural Information Processing Systems 7, pages
369–376, Cambridge, MA. The MIT Press.
[Wiewiora et al., 2003] Wiewiora, E., Cottrell, G. W.,
and Elkan, C. (2003). Principled Methods for Advising Reinforcement
Learning Agents. In International Conference on Machine Learning, pages
792–799.
[Riedmiller, 2005] Riedmiller, M. (2005). Neural Fitted
Q Iteration - First Experiences with a Data Efficient Neural
Reinforcement Learning Method. In European Conference on Machine
Learning, pages 317–328.
[Bagnell, 2004] Bagnell, J. (2004). Learning Decisions:
Robustness, Uncertainty, and Approximation. PhD thesis, Robotics
Institute, Carnegie Mellon University, Pittsburgh, PA.
[Sutton, 1996] Sutton, R. S. (1996). Generalization in
Reinforcement Learning: Successful Examples Using Sparse Coarse Coding.
In Touretzky, D. S., Mozer, M. C., and Hasselmo, M. E., editors,
Advances in Neural Information Processing Systems, volume 8, pages
1038–1044. The MIT Press.