INFO I427 Search Informatics (3 CR)
Google under the hood

Tentative Spring 2009 schedule -- will be updated as we go. Readings (from Belew's FOA) due ahead of each class. Assignments due by class time unless otherwise specified in class or on Oncourse.
Week Dates Topic Assignments Readings and notes
1 1/12-14 Intro to course, search engines, and Google: the biz buzz Ch 1
2 1/21 Unix review Review Perl: O'Reilly books, Books24x7 (free!)

STEPS Workshops:

All workshops at IUB Library Information Commons (IC103). Free workshops materials available from IT training

3 1/26-28 Perl review
4 2/2-4 More Perl review; LWP, CGI, DB_File and other modules
5 2/9-11 Web crawling Ch 8 and class notes
6 2/16-18 More Web Crawling
7 2/23-25 Indexing (parsing, stopping, stemming, inverting) A1 due Mon Ch 2
10/20 STEPS Workshop: Vi
8 3/2-4 More indexing
9 3/9-11 Retrieval and ranking (VSM, similarity) A2 due Mon Ch 3
Ch 6 (6.1-6.2), Google's PageRank Explained, and class notes
3/16-18 Spring Break
10 3/23-25 More retrieval and ranking
Link analysis (PageRank)
11 3/30-4/1 More link analysis
12 4/6-8 Search engine evaluation A3 due Mon Ch 4
13 4/13-15 Search APIs Download Perl module for Yahoo Search API and get it to work! (You need an AppID from Yahoo Search Web Services.)
14 4/20-22 More evaluation A4 due Wed
15 4/27-29 Review, Q&A on project, discussion Final project due Wed Free Week
16 5/6 Final exam 5-7 pm in TBA