| Description | Machine learning techniques to mine the Web and other information networks, social networks, and social media. Crawling, indexing, ranking and filtering algorithms using content and link analysis. Applications to search, classification, recommendation, and Web intelligence. Group project on one of the topics covered in class. |
|---|---|
| Prerequisites | This course is open to CS, Informatics, SLIS, CogSci, and other graduate students with an interest in information systems, artificial intelligence, and Web science. Although prior exposure to machine learning algorithms, information retrieval, and/or Web programming is helpful, there are no advanced AI or DB prerequisites. Strong coding skills (in any language) are highly recommended. |
| Textbook | Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data by Bing Liu (with a chapter on crawling by yours truly, slides here), Springer, 2007. The second edition came out in 2011; either edition is fine for this course. Another excellent reference is Mining the Web by Soumen Chakrabarti, Morgan-Kaufmann, 2002, which we used in past offerings of this course. Note the second edition of this book is in the making. |
| Lecture | TR 2:30-3:45P in I (Informatics West) 107 (map) |
| Instructor | Fil Menczer (Office hours by appointment in Info East 314; please schedule in class) |
| AIs | Mohsen (Office hours Tu-Th 10am-noon) |
| Contact | Please use the group discussions for all class-related questions and communications, unless privacy is necessary. |