[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
LRTDP paper
- To: Rao Kambhampati <rao@asu.edu>
- Subject: LRTDP paper
- From: Subbarao Kambhampati <rao@asu.edu>
- Date: Wed, 6 Oct 2010 16:03:13 -0700
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:sender:received:date :x-google-sender-auth:message-id:subject:from:to:content-type; bh=FHKvL6gpsjAWVeSoZp0Pahtxpt9s/qaaWYyB7KNh2Zw=; b=lWFvKWpbRIygUubljhXyxAcV2l/2hxO/dl2/fMkMd/Goj/7fIACT/ctfp4IVZ+BJbR zadqNCy6kp5ju2Y5S21x0Yma4XIfkh+EpicBwffbl1tz2yNCR/BWcCHx8QDRRmsPj2FO f/VDEP03A0kaA0/9X4BKOeD6/2rV2bXYzhDCY=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:sender:date:x-google-sender-auth:message-id:subject :from:to:content-type; b=NTUHtX36i+81/7ZlEEHR0FqF8tueE2FCGkHSevwLpABz1ecktG022IK5d4e3rcvOk9 9E2QE0oeUiiJKqshIf0kyhSVO/TTzIYNe3YSUE8agVJqdUjcpnyYFKMVbFcomCHPHz0F f7P4RQVuvSCmUEJA75NZCxbPpHj2LiCiaSMYM=
- Sender: subbarao2z2@gmail.com
is here http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.75.4460 (only the "cached copy" link works)
rao
ps: ask yourself this: How come lrtdp overwrites the current states's value with the q-value of its greedy action? Shouldn't it be doing something like taking the min of the current value and the q-value of the greedy action?