Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
 

Dr. Dustin Lange

former member

Research interests

  • Similarity Search
  • Similarity Measures
  • Search Engines
  • Machine Learning

Projects

Publications

  • Reach for Gold: An Annealing Standard to Evaluate Duplicate Detection Results. Vogel, Tobias; Heise, Arvid; Draisbach, Uwe; Lange, Dustin; Naumann, Felix in JDIQ (2014). 5(1-2)
     
  • Cross-lingual Entity Matching and Infobox Alignment in Wikipedia. Rinser, Daniel; Lange, Dustin; Naumann, Felix in Information Systems (IS) (2013). 38(6) 887–907.
     
  • Bulk Sorted Access for Efficient Top-k Retrieval. Lange, Dustin; Naumann, Felix (2013).
     
  • Cost-Aware Query Planning for Similarity Search. Lange, Dustin; Naumann, Felix in Information Systems (IS) (2013). 38(4) 455–469.
     
  • Efficient Similarity Search in Very Large String Sets. Fenz, Dandy; Lange, Dustin; Rheinländer, Astrid; Naumann, Felix; Leser, Ulf (2012).
     
  • Scalable Similarity Search with Dynamic Similarity Measures. Köppelmann, Martin; Lange, Dustin; Lehmann, Claudia; Marszalkowski, Marika; Naumann, Felix; Retzlaff, Peter; Stange, Sebastian; Voget, Lea (2012).
     
  • Projektseminar "Similarity Search Algorithms". Lange, Dustin; Vogel, Tobias; Draisbach, Uwe; Naumann, Felix in Datenbank-Spektrum (2011). 11(1) 51–57.
     
  • Efficient Similarity Search: Arbitrary Similarity Measures, Arbitrary Composition. Lange, Dustin; Naumann, Felix (2011). 1679–1688.
     
  • Frequency-aware Similarity Measures. Lange, Dustin; Naumann, Felix (2011). 243–248.
     
  • Extracting structured information from Wikipedia articles to populate infoboxes. Lange, Dustin; Böhm, Christoph; Naumann, Felix (2010). 1661–1664.
     
  • Extracting structured information from Wikipedia articles to populate infoboxes. Technical Report (38), Lange, Dustin; Böhm, Christoph; Naumann, Felix (2010).
     

Master's Thesis Supervision

  • Matthias Pohl: Automatisierte Konfiguration des D-Index zur Ähnlichkeitssuche, 2011
  • Dandy Fenz: Effiziente Ähnlichkeitssuche in einer großen Menge von Zeichenketten mittels Key-Value-Store, 2011
  • Daniel Rinser: Wikipedia Cross-lingual Concept Identification and Infobox Alignment, 2010

Project Supervision

Teaching

Memberships