CSE 574: Planning & Learning.

Fall 2022 (Fridays 1:30--4:15pm; CAVC351)

This course will focus on sequential decision-making problems under a variety of conditions--with known (given) model--normally called planning, and unknown (partially known) model--normally called -- Reinforcement Learning. We will study both deterministic and stochastic settings, atomic as well as lifted (symbolic) models, and function approximation models (e.g. deep reinforcement learning).

The last time I (Rao) taught CSE574 was back in 2008 (!!). The lectures from that offering are below. The Fall 2022 version of the Planning/Learning class will change in significant ways to keep up with the new developments. One important goal is to draw clearer connections between planning and reinforcement learning.

Please note that this course will assume familiarity with Intro AI (see here for Rao's version)

This course will be a cross between a normal lecture course and a seminar course. In addition to instructor lectures, the course will involve paper readings and discussions. It is thus most useful for people with research interests in this area.

(Student comments on Rao's courses can be found here)

Lectures from the 2008 edition of CSE574 are linked below (Note that this edition didn't have any videos. The closest set of videos of my lectures for this material can be found HERE)

1/15: Introduction. (By Menkes/J)(audio is here)
1/17: Progression, regression and Means-Ends analysis planning. (audio is here)
1/22: Plan-space planning (aka partial order planning or least commitment planning); handling conditional effects in progression/regression/plan-space planning; Lifted planning (audio is here)
1/24: Refinement planning as a unifying view (audio is here)
1/29: Reachability heuristics for classical planning (audio is here) (You can also get a "lecture" style discussion of the material from here)
1/31:Bounded-length Planning: Graphplan; SAT/CSP/IP Compilation (audio is here)
2/5: Bounded-length planning: Planning as Model-finding (audio is here)
2/7:Knowledge-based planning (audio is here)
2/12: Knowledge-based planning (contd) (audio is here)
2/14: Learning search control for planning (audio is here)
2/19: Over-Subscription/Partial Satisfaction Planning (audio is here)
2/21: Replanning & Execution MOnitoring (audio is here)
2/26: Temporal Planning: Modeling issues (audio is here)
2/28: Temporal Planning: Search (audio is here)
3/4: Temporal CSPs (audio is here)
3/6: Scheduling (audio is here)
Week of 3/9 is spring break.
3/18: Integrating Planning & Scheduling (audio is here)
3/20: Belief Space Search (audio is here)
3/25: Belief Space Search Continued (Conformant vs. Conditional Planning) (audio is here)
3/27: Belief Space Search: Heuristics (audio is here) (Also check out a BDD mini-tutorial here)
4/1: Decision Theoretic Planning/Markov Decision Processes (audio is here)
- Here is another set of slides on VI and PI
4/3: FOMDPS: Special Cases/Efficient Algorithms (audio is here)
4/8: Reinforcement Learning (lecture by Sungwook Yoon) (audio is here)
4/10: Efficient Search (LAO*; RTDP) and Factored Reps for FOMDPS (audio is here)
4/15: Probabilistic Planning Competition; Sampling methods for solving MDPs. (audio is here)
4/17: Plan Recognition: Overview and Consistency-based methods. (audio is here)
4/22: Plan Recognition: Probabilistic Methods; Decision-Theoretic Assistance (audio is here)
- 4/23: The Myth and Reality of Web Services Composition (audio is here)
4/24:Online Continual Planning to Control Modular Production Printer
4/29: Final Class.
5/6: Final Exam (2:40--4:30pm)

Planned Topics

Introduction, representation & search (1 week)
State Space and Plan Space Planning, Lifting (1 week)
Reachability heuristics (1- week)
SAT/CSP/IP based planning graph search; Planning as model-finding (1- week)
Refinement Planning as a unifying framework (1 week?)
- RECITATION: Case studies of heuristic planners. Graphplan search
Partial satisfaction planning (1 class)
Knowledge-based planning with some emphasis on HTN planning (1 week)
Model-lite planning (1 class?)
Metric/Temporal Planning (1 week)
Scheduling (1 week)
Non-deterministic Planning: Conformant and Conditional planning (1 week)
Probabilistic planning: MDPs and POMDPS (2+ weeks)
Plan & Activity recognition (1 week)
Monitoring and Diagnosis (1/2 class)
Online planning (1/2 class)
Multi-agent planning (1 class)
Planning & Learning (1+ class)

Subbarao Kambhampati

Last modified: Wed Apr 23 14:28:54 MST 2008