|
When and where |
Wed, 4:00PM-7:00PM, Room Whitaker Lab 203 |
|
Instructor |
|
|
Instructor's office hours |
Mon, 4:00PM-5:00PM, Room 252 Packard Lab |
Required:
Reinforcement Learning: An Introduction Richard S. Sutton and Andrew G. Barto
RL is motivated by fields such as behavioral psychology. This motivation can be illustrated in the following video which shows initial training of a dog to take the action of staying put. The incentive or reward is to gain some food.
RL has been shown to be useful to solve a wide variety of tasks including (click links to see some videos): autonomous vehicles navigation tasks, robotics and programming game AI (under “downloads” check the videos of before and after learning).
Following the presentation of Sutton and Barto’s book, we will formalize the reinforcement learning problem as a Markov Decision process (MDP). We will study techniques for solving this problem, limitations and research issues. Concepts such as Markov states, Markov property, dynamic programming and Monte Carlo methods will be covered. For further details please read Chapter 1 of the book.
** All announcements will be made in this web page:
www.cse.lehigh.edu/~munoz/RLMDP
All announcements, handouts, etc. will be posted in this web site:
www.cse.lehigh.edu/~munoz/RLMDP
There will be homework assignments.
Attendance to class is required.
There will be two exams but no final exam.
The final project will be a programming project of the student's choice provided that it has been approved by the instructor. In addition the student must handle a final report describing properties and empirical evaluation of their implementation (details will be agreed with the instructor before hand).
Last update: Mon Aug 17 16:47:57 EDT
2009