Hasso-Plattner-InstitutSDG am HPI
Hasso-Plattner-InstitutDSG am HPI

Search Engine Implementation (Sommersemester 2014)

Dozent: Prof. Dr. Felix Naumann (Information Systems) , Prof. Dr. Ralf Krestel (Information Systems)

Allgemeine Information

  • Semesterwochenstunden: 2
  • ECTS: 3
  • Benotet: Ja
  • Einschreibefrist: 1.4.2014 - 28.4.2014
  • Lehrform: Seminar
  • Belegungsart: Wahlpflichtmodul

Studiengänge, Modulgruppen & Module

IT-Systems Engineering BA


In this seminar groups of students will design and

implement a search engine on top of the German Wikipedia. Based on the

information retrieval lecture, different ranking algorithms should be

implemented, such as BM25 or language models. The search engine should also

support boolean search, prefix search, and retrieve linked pages. To present

the search result rankings, a snippet generator algorithm has to be developed

and finally we will compare the search results with Wikipedia's search engine

results for a set of test queries to find the best implementation.




This seminar is for Master students who also participate in the Information Retrieval lecture.


This seminar is based on this book available in the university library.

W. Bruce Croft, Donald Metzler, and Trevor Strohman: Search Engines: Information Retrieval in Practice

Lern- und Lehrformen

In this seminar teams will implement a search engine. Project management, team organization, and programming in teams will be the focus.


Presentation of the implemented search engine beginning

of October. Exact date tba.