![]() |
Reinforcement Learning and
Artificial
Intelligence (RLAI) |
Tea-time Talks
|
| Date |
Presenter |
Topic |
Link |
| May 15-18 |
|||
| Mon |
Rich Sutton |
RL in finance paper by Kearns et
alia |
NFK06 |
| Tue |
Michael Bowling |
Reduced Rank Regression |
|
| Wed |
Dan Lizotte |
Intrinsic Reward Exploration
(Simsek & Barto) |
|
| Thu |
Dale Schuurmans |
Boosting |
|
| May 22-25 |
[Organiser: Nolan] |
||
| Mon |
[Victoria Day] |
||
| Tue |
Alborz |
RL in Matrix Form | TD0GD |
| Wed |
Mark Ring |
Konidaris, Barto: Knowledge
Transfer in RL (ICML 2006) |
autoshape |
| Thu |
James Neufeld |
Image Features Using SIFT | SIFT |
| May 29-June 1 |
[Organiser: Brad] |
||
| Mon |
Amir massoud Farahmand |
Policy Gradient in Continuous Time (Remi Munos) | JMLR06 |
| Tue |
Russ Greiner |
KDD Cup 2006 | KDDCup |
| Wed |
Anna Koop |
Information Theoretic Manifold
Discovery |
PDF |
| Thu |
[Cancelled] |
||
| June 12-15 |
[Organiser: Alborz] |
[Note: 1 week break for NIPS] |
|
| Mon |
nolan |
Constructing Informative Priors
using Transfer Learning |
.pdf |
| Tue |
[Cancelled for Ken Thompson] |
||
| Wed |
masoud |
RL with Context Detection (ICML
2006) |
PDF
|
| Thu |
bjjoyce |
Using Inaccurate Models in RL
(ICML 2006) |
PDF |
| June 19-22 |
[Organiser: James] |
||
| Mon |
Tao Wang |
An Analytic Solution to Discrete
Bayesian RL |
PDF |
| Tue |
maz |
||
| Wed |
johanson |
||
| Thu |
coulthar |
Graph Visualization using Disc
Trees |
|
| July 3-6 |
[Organiser: Anna] |
[1 week break for IJCAI deadline and ICML conference] | |
| Mon |
[Canada Day] |
||
| Tue |
Scott Sanner |
Relational RL and feature discovery | SS06 |
| Wed |
Vadim Bulitko |
AIIDE report |
|
| Thu |
carmelo |
Hierarchical Temporal Memory (Hawkins, George, Numenta Inc.) | |
| July 10-13 |
[Organiser: Brian] |
||
| Mon |
armita |
||
| Tue |
yaki |
Variance of discounted Markov
decision processes (Matthew Sobel) |
JSTOR82 |
| Wed |
erafols |
Representing Systems with Hidden
State (AAAI 2006) (C. Hundt, P. Panagaden, J. Pineau, D. Precup) |
PDF |
| Thu |
cosmin |
Planning with Approximate,
Learned Models |
|
| July 17-20 |
[Organiser: Masoud] |
||
| Mon |
awhite [CSC 3-49] |
Cognitive Maps in Rats and
Robots (Vernena Hafner) |
|
| Tue |
silver [CSC 3-49] |
Monte-Carlo Tree Search (Remi
Coulom) |
RC06 |
| Wed |
mlee |
Experience-Efficient Learning in
Associative Bandit Problems |
PDF |
| Thu |
marthal |
Predictive State Representations
with Options (Wolfe and Satinder) |
| Date |
Presenter |
Topic |
Link |
| July 24 (Monday) |
Second round of TTT: organizational meeting | ||
| July 31-August 3 |
[Organiser: Sergey] |
||
| Mon |
Alborz Geramifard |
AAAI 2006 and AIIDE 2006 in a
glance |
|
| Tue |
Sergey Kirshner |
Trees: what are they good
for? |
ChowLiu68 |
| Wed |
Brian Tanner | The Wumpus World as a challenge
problem for representations and reinforcement learning |
|
| Thu |
Anna Koop |
CogSci 2006 summary |
CogSci2006 |
| August 7-10 |
[Organiser: Dan] |
||
| Mon |
[Heritage Day] |
||
| Tue |
Martin Zinkevich |
Teaching and Learning |
|
| Wed |
David Silver |
Planning, predictive representations and two-player games | |
| Thu |
James Kehoe |
Real-Time Processes are Paramount in Classical and Operant Conditioning | |
| August 14-17 |
[Organiser: Tao] |
||
| Mon |
Elliot Ludvig |
Why Rats are Smarter than Pigeons: Lessons for AI from Animal Timing | |
| Tue |
Mohammad Ghavamzadeh |
On the Nystrom Method for
Approximating a Gram Matrix for Improved Kernel-Based Learning |
PDF |
| Wed |
Rich Sutton | the TD model of classical
conditioning |
pdf |
| Thu |
Michael Bowling |
If it worked for NLP... | |
| August 21-24 |
[Organiser: Michael] |
||
| Mon |
Brian Tanner |
Predictive Action Descriptions
from Experience (please come I need help) |
|
| Tue |
Amir massoud Farahmand |
Samuel Meets Amarel: Automating Value Function Approximation using Global State Space Analysis | AAAI05 |
| Wed |
Martin Zinkevich |
Maximum Margin Planning |
PDF |
| Thu |
Mark Lee | Does the Turing Test Demonstrate Intelligence or Not? | |
| August 28-31 |
[Organiser: Cosmin] |
||
| Mon |
Rob Holte |
Coarse-to-Fine Dynamic
Programming |
|
| Tue |
Carmelo Piccione |
How fast to work:Response vigor,
motivation, and tonic dopamine (Niv et al, NIPS 2005) |
PDF |
| Wed |
James Neufeld [CSC 3-49] |
A brief overview of predictive linear Gaussian (PLG) models | PDF |
| Thu |
[Cancelled] |
||
| Fall
Schedule |
Talks every Wednesday, 4:30-5pm
in CSC 3-33 |
||
| September 6 |
Cameron Upright | A 3-dimensional simulation of
the Octopus arm |
|
| September 13 |
[Cancelled] |
||
| September 20 |
James Kehoe |
Is the generation of a learned
response linear or catastrophic? |
|
| September 27 |
Katherine Davison [CSC 3-49] |
Kernel Predictive Linear
Gaussian Models |
|
| October 4 |
Vadim Bulitko [CSC 3-49] |
Learning target model in moving target pursuit | Slides |
| October 11 |
[Cancelled] |
||
| October 18 |
Mark Ring / Tao Wang | The Simulated Adaptive Behavior
(SAB) Conference, held in Rome, September 2006 |
SAB'06 |
| October 25 |
[Cancelled] |
||
| November 1 |
Csaba Szepesvari | Exploration-Exploitation Algorithms for finite MDPs | BK97 |
| November 8 |
Nolan Bard |
An Introduction to Bayesian
Filtering Techniques |
|
| November 15 |
Mohammad Ghavamzadeh |
Linearly-Solvable Markov Decision Problems | PDF |
| November 22 |
|
[Cancelled] |
|
| November 29 |
Eric Coulthard |
Netflix Million Dollar Prize | Link |
| December 6 |
[NIPS] |
||
| December 13 |
|||
| Future |
|||
| johanson |
|||
| Andrew Butcher | Temporal Difference Models and Reward-Related Learning in the Human Brain | Paper | |
| Armita Kaboli |
An Efficient Placement and Routing Technique for Fault-tolerant Distributed Embedded Computing | ||
| greiner |
|||
| dale |
|||
| cosmin |
|||
| masoud |
|||
| bjjoyce | |||
| awhite | |||
| erafols |


http://icml2006.org/icml2006/technical/accepted.html