Marcin Sydow - Web Information Retrieval / Web Mining Course

(wersja polska)

Studies: Graduate, Computer Science, Polish-Japanese Institute of Information Technology, spring/summer 2007
Lectures: 13 --- Classes/Labs: No --- Project: Optional --- Exam: Written/Oral

Description: Introduction to Web Information Retrieval and Web Mining. Wide overview of selected topics
Language: Polish

Syllabus:
  1. Introduction - Web Mining, Web Information Retrieval, Information Age, Web and search: statistics, challenges, Search Engines - overview
  2. Basics of Information Retrieval, Boolean Queries, Ranking, Evaluation of IR
  3. Large-scale Crawling
  4. Indexing, Repository
  5. Large-scale infrastructure: hardware, architecture and software - case study: Google
  6. Power-Law, Statistical and Structural Properties of the Web Graph, Link Analysis - introduction
  7. Link-based ranking, PageRank, Topic-sensitive PageRank, computational issues
  8. HITS and variants (PHITS), applications for reputation systems in on-line auctions
  9. Selected topics in Web Content Mining
  10. The Economics of Web Search: Internet Advertising and Spam
  11. The Economics of Web Search (continued)
  12. (Class Work, date: 15.06.07)
  13. Web Usage Mining
back to: Marcin Sydow - home page
Last updated: 07 June 2007