CSE 494/598 Information Retrieval, Mining and Integration on the Internet

Instructor: Subbarao Kambhampati

Next offering: Spring 2010 (T/Th 10:30--11:45AM; BYAC 150)

Note for Undergrads: If you are unable to get in because of capacity restrictions, show up for the first class anyway. There may well be seats by the end of first week.

This course is geared towards exposing students to some of the core technologies for controlling and using the content on the Internet. The following are some of the questions we will consider:

How do search engines work? Why are some pp better than others?
Can we think of the web as a big database/knoweldge base and support efficient database style query processing?
Can we find useful pearls and patterns in the mass of accessible data on the Internet?

This course will be breadth-oriented introduction to the issues involved in answering these questions.

Prerequisites: CSE 310 required. Other courses that will help include CSE 471 (AI) CSE 412 (Databases) and CSE 450 (Algorithms). I am hoping that students have had at least one of these 4-level courses already, but won't insist on them. Students planning to register for this course are encouraged to talk to the instructor (via email at rao wholivesat asu dot edu).

Grading: The grading will be based on class participation, exams and projects.

Textbooks: There is no prescribed textbook. We will read papers (see the reading list.)

Overview: The best overview is the list of topics and lecture notes from the previous offering (shown below).

Additional pointers:

Check out what students say about the last offering of this course in CEAS student evaluations.
Check out what students "learned" from the last offering of this course (in their own words).
Check out what earlier students did in the last offering of the course
Check out what some of the earlier students are doing now.

Lecture Notes & Audio from Fall 2008

Notes from Fall 2008; T/Th 1:30--2:45pm, BY 270

Instructor: Subbarao Kambhampati (Office Hours: T/Th 2:45--3:45PM BY 560)

(TA: Garrett Wolf. Office Hours: Wed: 3:30--4:30pm BY557AD)

classes coming up

12/9: Final class
12/16: Final exam (12:10--2:00pm)

Todo

Search contextualization..
Measures for comparing ranked lists..

Last modified: Mon Jan 11 15:11:24 MST 2010