CSE 574: Planning & Learning.
Fall 2022 (Fridays 1:30--4:15pm; CAVC351)
This course will focus on sequential decision-making problems under a
variety of conditions--with known (given) model--normally called
planning, and unknown (partially known) model--normally called --
Reinforcement Learning. We will study both deterministic and
stochastic settings, atomic as well as lifted (symbolic) models, and
function approximation models (e.g. deep reinforcement learning).
The last time I (Rao) taught CSE574 was back in 2008 (!!). The
lectures from that offering are below. The Fall 2022 version of the
Planning/Learning class will change in significant ways to keep up
with the new developments. One
important goal is to
draw clearer connections between planning and
reinforcement learning.
Please note that this course will assume familiarity with
Intro AI
(see here for Rao's version)
This course will be a cross between a normal lecture course and a
seminar course. In addition to instructor lectures, the course will
involve paper readings and discussions. It is thus most useful for
people with research interests in this area.
(Student comments on Rao's courses can be found here)
Lectures from the 2008 edition of CSE574 are linked below (Note that
this edition didn't have any videos. The closest set of
videos of my lectures for this material can be found HERE)
- 1/15: Introduction. (By Menkes/J)(audio is
here)
- 1/17: Progression, regression and Means-Ends analysis planning. (audio is
here)
- 1/22: Plan-space planning (aka partial order planning or least commitment planning); handling conditional effects in progression/regression/plan-space planning; Lifted planning (audio is
here)
- 1/24: Refinement planning as a unifying view (audio is
here)
- 1/29: Reachability heuristics for classical planning (audio is
here)
(You can also get a "lecture" style discussion of the material from
here)
- 1/31:Bounded-length Planning: Graphplan; SAT/CSP/IP Compilation (audio is
here)
- 2/5:
Bounded-length planning: Planning as Model-finding (audio is
here)
- 2/7:Knowledge-based planning (audio is
here)
- 2/12:
Knowledge-based planning (contd) (audio is
here)
- 2/14: Learning search control for planning (audio is
here)
- 2/19: Over-Subscription/Partial Satisfaction Planning (audio is
here)
- 2/21: Replanning & Execution MOnitoring (audio is
here)
- 2/26: Temporal Planning: Modeling issues (audio is
here)
- 2/28: Temporal Planning: Search (audio is
here)
- 3/4: Temporal CSPs (audio is
here)
- 3/6: Scheduling (audio is
here)
Week of 3/9 is spring break.
- 3/18: Integrating Planning & Scheduling (audio is
here)
- 3/20: Belief Space Search (audio is
here)
- 3/25: Belief Space Search Continued (Conformant vs. Conditional Planning) (audio is
here)
- 3/27: Belief Space Search: Heuristics (audio is
here)
(Also check out a BDD mini-tutorial here)
- 4/1: Decision Theoretic Planning/Markov Decision Processes (audio is
here)
- 4/3: FOMDPS: Special Cases/Efficient Algorithms (audio is
here)
- 4/8: Reinforcement Learning (lecture by Sungwook Yoon) (audio is
here)
- 4/10: Efficient Search (LAO*; RTDP) and Factored Reps for FOMDPS (audio is
here)
- 4/15: Probabilistic Planning Competition; Sampling methods for solving MDPs. (audio is
here)
- 4/17: Plan Recognition: Overview and Consistency-based methods. (audio is
here)
- 4/22: Plan Recognition: Probabilistic Methods; Decision-Theoretic Assistance (audio is
here)
- 4/24:Online Continual Planning to Control Modular Production Printer
- 4/29: Final Class.
- 5/6: Final Exam (2:40--4:30pm)
Planned Topics
-
Introduction, representation & search (1 week)
-
State Space and Plan Space Planning, Lifting (1 week)
-
Reachability heuristics (1- week)
-
SAT/CSP/IP based planning graph search; Planning as model-finding (1- week)
-
Refinement Planning as a unifying framework (1 week?)
-
RECITATION: Case studies of heuristic
planners. Graphplan search
-
Partial satisfaction planning (1 class)
-
Knowledge-based planning with some emphasis on HTN planning (1 week)
-
Model-lite planning (1 class?)
-
Metric/Temporal Planning (1 week)
-
Scheduling (1 week)
-
Non-deterministic Planning: Conformant and Conditional planning (1 week)
-
Probabilistic planning: MDPs and POMDPS (2+ weeks)
-
Plan & Activity recognition (1 week)
-
Monitoring and Diagnosis (1/2 class)
-
Online planning (1/2 class)
-
Multi-agent planning (1 class)
-
Planning & Learning (1+ class)
Subbarao Kambhampati
Last modified: Wed Apr 23 14:28:54 MST 2008