Prof. Dr. Felix Naumann


Search engines permeate every facet of our online lives and many offline. This lecture introduces the basic architectures and technology for search engines both on the Web and on other collections of digital artifacts. Topics covered include

  • Search Engine Architectures
  • Crawling
  • Text Processing
  • Ranking Indexes
  • Search Queries
  • Information Retrieval Methods
  • Search Engine Evaluation


  • If you have any ideas for future exercises or comments on the exercise/lecture, please don't hesitate to contact us or use this web form.


  • Tuesdays, 9:15 Uhr, HS3
  • Thursdays, 9:15 Uhr, HS3

The lectures are given in English and are available as tele-task recordings for logged in students.

Date Topic Slides
Tue 12.04.2011 Introduction pdf
Thu 14.04.2011 Architecture pdf
Tue 19.04.2011 Exercise 1: Nutch and Googlewhacking pdf, Nutch


Crawling pdf
Tue 26.04.2011 Crawling
Thu 28.04.2011 Exercise 2: Crawling Journal Club pdf
Tue 03.05.2011 Text processing pdf
Thu 05.05.2011 Exercise 3: Fingerprints & Zipf pdf, Texts, Duplicates
Tue 10.05.2011 Text processing
Thu 12.05.2011 Indexing pdf
Tue 17.05.2011 Indexing
Thu 19.05.2011 Exercise 4: Text Processing pdf
Tue 24.05.2011 Indexing
Thu 26.05.2011 Querying pdf
Tue 31.05.2011 Cancelled
Thu 02.06.2011 Christi Himmelfahrt
Tue 07.06.2011 Querying
Thu 09.06.2011 Exercise 5: Querying pdf, MovieSearch
Tue 14.06.2011 Querying
Thu 16.06.2011 Cancelled: Students are encouraged to attend FutureSOC Symposium
Tue 21.06.2011 Retrieval Models pdf
Thu 23.06.2011 Retrieval Models
Tue 28.06.2011 Moved to 06.07.
Thu 30.06.2011 Moved to 13.07.
Tue 05.07.2011 Exercise 6: Retrieval Models pdf
Wed06.07.2011, 17:00 Retrieval Models
Thu 07.07.2011 Retrieval Models
Tue 12.07.2011, 10:00 Exercise 7
Wed 13.07.2011, 17:00 Evaluation
Thu 14.07.2011 Question Answering: Saeedeh Momtazi pdf
Tue 19.07.2011 Social Search pdf
Thu 21.07.2011 Outlook



The written exam will take place on 27.07.2011 (Wednesday) from 10:00 to 12:00 in HS 1.