[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Readings for Tomorrow--LAO* and LRTDP
- To: "Rao Kambhampati" <rao@asu.edu>
- Subject: Readings for Tomorrow--LAO* and LRTDP
- From: "Subbarao Kambhampati" <rao@asu.edu>
- Date: Wed, 2 Apr 2008 06:56:14 -0700
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:sender:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition:x-google-sender-auth; bh=0hwJGAhjHyG1uFajTmCJQ3UCqLn9z3iX2iOVQk2qr54=; b=NBIdkyF72vhXM0+y5DrNAdCuokWbLkj7EfHWK8rk9gjMSgO3A7LQ3vAWwX6BYkpjikHXQAXiSHofdSmbgGdkyeLguTqSGhkNsaPw+B8gGzuVy1nuuam9IqOiBsoe2iFWc4vwBzB3N2xZSDGT/afTYzrKXuj76/cmmpafoUX81O4=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=message-id:date:from:sender:to:subject:mime-version:content-type:content-transfer-encoding:content-disposition:x-google-sender-auth; b=OVlcNhbAXl1MlA0aDJwlosZg7cr6r5haIr/Z5hLvRL1JvCeWeJAi9tC+gnjmjjpaI1/7eca9L2rWPavXMuYG5zPh9NEF1M4qbasLRodwOA261tTP8RiHuAHktQNrOCrrxjJlrFtf/Ycj7a88UW98HiWtYgjz5rGQe/93R8ePJMY=
- Sender: subbarao2z2@gmail.com
Part of what we will do tomorrow is to consider optimal policy
construction algorithms for
FOMDPs that make it look more like heuristic search that we have seen
for the rest of the semester.
In particular, we saw that classical planning can be seen as A*
search, and belief-space planning can be seen as
AO* search. Typical AO* search algorithms work on acyclic graphs (note
that AO* can be seen as a
problem decomposition framework, and cycles imply that you are
reducing a problem indirectly to itself).
The LAO* paper below shows that FOMDP policy construction can be seen
as AO* search on cyclic
graphs.
LAO*
http://www.cse.msstate.edu/~hansen/papers/laostar.pdf
Another idea for viewing value function computation is in terms of
fixed-depth expansion under a node
(as in game trees--in fact, in 471, I motivated game trees in terms of
RTDPs). The LRTDP algorithm
improves a bit on RTDP
LRTDP
http://ftp.cs.ucla.edu/pub/stat_ser/R319.pdf
rao