CSCI B656: Web Mining

Description	Machine learning techniques to mine the Web and other information networks, social networks, and social media. Crawling, indexing, ranking and filtering algorithms using content and link analysis. Applications to search, classification, recommendation, and Web intelligence. Group project on one of the topics covered in class.
Prerequisites	This course is open to CS, Informatics, SLIS, CogSci, and other graduate students with an interest in information systems, artificial intelligence, and Web science. Although prior exposure to machine learning algorithms, information retrieval, and/or Web programming is helpful, there are no advanced AI or DB prerequisites. Strong coding skills (in any language) are highly recommended.
Textbook	Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data by Bing Liu (with a chapter on crawling by yours truly, slides here), Springer, 2007. The second edition came out in 2011; either edition is fine for this course. Another excellent reference is Mining the Web by Soumen Chakrabarti, Morgan-Kaufmann, 2002, which we used in past offerings of this course. Note the second edition of this book is in the making.
Lecture	TR 2:30-3:45P in I (Informatics West) 107 (map)
Instructor	Fil Menczer (Office hours by appointment in Info East 314; please schedule in class)
AIs	Mohsen (Office hours Tu-Th 10am-noon)
Contact	Please use the group discussions for all class-related questions and communications, unless privacy is necessary.