 |
Reinforcement Learning and
Artificial
Intelligence (RLAI)
|
Function
Approximation Assignments
|
These may be assigned this year.
For this assigment you can think of the domain to be a 100 x 100 real
value coordinate system. You may realize that the dimensions
don't matter - but having a fixed size may make it easier to imagine
for now.
Take tile coder and run on the following data:
Training Data Points:
X Coordinate
|
Y
Coordinate
|
Value
|
30.0
|
10.0
|
5.0
|
70.0
|
20.0
|
3.0
|
8.0
|
50.0
|
-1.0
|
Test (Query) Data Points:
X
Coordinate
|
Y
Coordinate
|
Value
|
55.8
|
45.2
|
?
|
37.5
|
99.0
|
?
|
30.5
|
9.5
|
?
|
30.5
|
15
|
?
|
Use 16 tilings over the XY space and examine what happens when you
change the width of the tiles and alpha. To start, use a width of 10 (one tenth of the 100 x 100 space) and alpha = .1
You can download the tile coding software from:
http://rlai.cs.ualberta.ca/RLAI/RLtoolkit/RLtoolkit1.0.html#downloads
We will also be posting a more minimal download that is just the tile
coding software in the next couple of days.
Tile
coding software only
Tile
coding software documentation
Handin:
RELEVANT portions of your code. Do not hand in the RL code that
you downloaded, PLEASE.
Your results for various widths and alphas. Talk about your
results a bit. It would be nice (but not necessary) for you to
include some sort of 3d view (with Excel or similar) of what the
function looks like after training. Basically, do the assignment,
see what you find interesting, and briefly present it.