[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
A good source of textbookish readings for IR material + Links for/from today's class (including why "miserbale failure" didn't work)..
- To: cse494-s07@parichaalak.eas.asu.edu
- Subject: A good source of textbookish readings for IR material + Links for/from today's class (including why "miserbale failure" didn't work)..
- From: "Subbarao Kambhampati" <rao@asu.edu>
- Date: Thu, 8 Feb 2007 18:25:05 -0700
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:sender:to:subject:mime-version:content-type:x-google-sender-auth; b=XjWcASv5GDBs+X61t5dMIydnxjubKh0EuDCl1HUqZEZUA4D/YVTOkkmLvDCZeYjDb8oTPhpXeDVEg9ItincaanyYfBHx2R1VgQmZ82OPjvw+45gTbWKL8dUSSwJXkgkZ2hTALRti6NlG8Vmfaaek/FAb8D0OYhw69ktU7CUfuw4=
- Sender: subbarao2z2@gmail.com
First, off, here is an NYT article explaining that as of Jan end, 2007, Google decided to
quit acting all innocent and hand-removed the miserable failure google bomb link to
whitehouse
http://www.nytimes.com/2007/01/29/technology/29google.html
On a more serious front, I added a new uber-link in the readings list to a draft text book on
on information retrieval that is freely available on the web.
http://www-csli.stanford.edu/%7Eschuetze/information-retrieval-book.html
You might consider consulting it as
a good additional source of readings on topics discussed in the class
(note: some chapters are more drafty than others)
You can find discussion on how to statistically estimate relative index sizes of search engines in chapter
19 (section 19.5). The specific link is:
http://nlp.stanford.edu/IR-book/pdf/chapter-webchar.pdf
That whole chapter is a quick overview of web search issues/challenges.
distributed index and crawling discussion can be found in the next chapter: ch 20.
cheers
Rao