CSE 494/598 Information Retrieval, Mining and Integration on the Internet

Instructor: Subbarao Kambhampati

Next offering: Fall 2011 (T/Th 4:30--5:45pm, BYENG 270 (**NEW BIGGER ROOM**))

Notice: If you are trying to get into either 494 or 598 sessions of this class for Fall 2011, please show up on the first day. I will not be able to give any individual overrides at this time. My experience however is that there is a lot of churn over summer and everyone who wants a seat typically gets one when the dust settles down..

This course is geared towards exposing students to some of the core technologies for controlling and using the content on the Internet. The following are some of the questions we will consider:

  1. How do search engines work? Why are some pp better than others?
  2. Can we think of the web as a big database/knoweldge base and support efficient database style query processing?
  3. Can we find useful pearls and patterns in the mass of accessible data on the Internet?

This course will be breadth-oriented introduction to the issues involved in answering these questions.

Prerequisites: CSE 310 required. Other courses that will help include CSE 471 (AI) CSE 412 (Databases) and CSE 450 (Algorithms). I am hoping that students have had at least one of these 4-level courses already, but won't insist on them. Students planning to register for this course are encouraged to talk to the instructor (via email at rao wholivesat asu dot edu).

Grading: The grading will be based on class participation, exams and projects.

Textbooks: There is no prescribed textbook. We will read papers (see the reading list.)

Overview: The best overview is the list of topics and lecture notes from the previous offering (shown below).

Additional pointers: