Hasso-Plattner-Institut
  
Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
  
 

Similarity Search

Similarity search refers to the task of finding objects that are similar to a given query in a set of objects. Common DBMS only provide means to efficiently find exact matches to a given query. In case of typing errors, omitted or transposed attribute values or other typical data quality problems in queries, exact search algorithms fail to find all relevant objects in the queried data set.

In this project, we survey existing and develop new algorithms for effective and efficient similarity search. Effective similarity search can be achieved by defining a similarity measure that is well-suited for the given domain. For efficient similarity search, an index structure is required that precomputes similarities of objects to answer queries as fast as possible.

This project is supported by SCHUFA Holding AG.

Project members:

Master's theses:

  • Matthias Pohl: Automatisierte Konfiguration des D-Index zur Ähnlichkeitssuche, 2011
  • Dandy Fenz: Effiziente Ähnlichkeitssuche in einer großen Menge von Zeichenketten mittels Key-Value-Store, 2011

Publications

1.
Dustin Lange, Felix Naumann
Information Systems (IS), vol. 38(4):455–469 2013
2.
Dustin Lange, Felix Naumann
In Proceedings of the International Conference on Scientific and Statistical Database Management (SSDBM), Baltimore, Maryland, 2013
3.
Dandy Fenz, Dustin Lange, Astrid Rheinländer, Felix Naumann, Ulf Leser
In Proceedings of the International Conference on Scientific and Statistical Database Management (SSDBM), Chania, Crete, Greece, 2012
4.
Martin Köppelmann, Dustin Lange, Claudia Lehmann, Marika Marszalkowski, Felix Naumann, Peter Retzlaff, Sebastian Stange, Lea Voget
In Proceedings of the 6th International Workshop on Ranking in Databases (DBRank) in conjunction with VLDB, Istanbul, Turkey, 2012
5.
Dustin Lange, Tobias Vogel, Uwe Draisbach, Felix Naumann
Datenbank-Spektrum, vol. 11(1):51-57 2011
6.
Dustin Lange, Felix Naumann
In Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM), pages 1679–1688, Glasgow, Scotland, UK, 2011
7.
Dustin Lange, Felix Naumann
In Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM), pages 243–248, Glasgow, Scotland, UK, 2011