[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

CSE571: MDP homework and project (to be continued)

To: Subbarao Kambhampati <rao@asu.edu>
Subject: CSE571: MDP homework and project (to be continued)
From: "Tuan A. Nguyen" <tanguye1@asu.edu>
Date: Fri, 31 Aug 2012 14:57:58 -0700

Hi all:

Please find here the homework and project for the MDP part of the course. We will also post one more exercise for the finite-horizon MDP, so stay tuned.

---

Part 1:

Do the exercises 17.4, 17.8, 17.9 and 17.10 in the AIMA textbook (see the attachment).

Part 2: Implement the following algorithms: value iteration, policy iteration and modified policy iteration. Test your code with the world in Figure 17.1 and observe the performance of those algorithms with different starting states. Use your code to verify your answer for exercise 17.8. You will need to submit your code and report on what you observe through these experiments.

Note that you will use your implementation for Reinforcement Learning project, so make it as flexible as you can. No constraints on languages.

Due date: 09/19 (this can be changed depending on how slow the lecture is going in class)

---

Please let me know if you have any questions.

Tuan

Attachment: MDP Homeworks.pdf
Description: Adobe PDF document

Prev by Date: Fwd: Reading for MDP discussion: R&N Chapter 17.1--17.3
Next by Date: CSE571: MDP homework and project
Previous by thread: Fwd: Reading for MDP discussion: R&N Chapter 17.1--17.3
Next by thread: CSE571: MDP homework and project
Index(es):
- Date
- Thread