Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
  
 

Ioannis Koumarelas

I am a Ph.D. student at the Infomation Systems Research Group and my research started in collaboration with SAP and SAP Concur. Through my Ph.D. I have worked in the general area of Data Cleaning, Data Preparation, with my main focus on Duplicate Detection.

Hasso-Plattner-Institut
für Softwaresystemtechnik
Prof.-Dr.-Helmert-Straße 2-3
D-14482 Potsdam
Office: F-2.05, Campus II

Phone: +49 331 5509 1377
Email:  Ioannis Koumarelas (click)
Research: GoogleScholar, ResearchGate, DBLP
Profiles: LinkedIn, GitHub

Research Interests

  • Duplicate Detection (Record Linkage, Entity Resolution etc.), Data Cleaning, Data Preparation
  • Address Geocoding
  • Parallel and Distributed Systems, Big Data Management
  • Data Profiling
  • Data Mining, Machine Learning, Deep Learning

Projects

Cooperation project with SAP and SAP Concur, for Vendor Data Cleaning of hotels. Our main task has been to apply Duplicate Detection, thus identify duplicates and understand what are their causes. The approaches we followed mainly use data preparation and matching dependencies, for which more information is further available through our publications.

Publication list

Combination of Rule-based and Textual Similarity Approaches to Match Financial Entities

Samiei, Ahmad; Koumarelas, Ioannis; Loster, Michael; Naumann, Felix in ACM , 2016 .

Record linkage is a well studied problem with many years of publication history. Nevertheless, there are many challenges remaining to be addressed, such as the topic addressed by FEIII Challenge 2016. Matching financial entities (FEs) is important for many private and governmental organizations. In this paper we describe the problem of matching such FEs across three datasets: FFIEC, LEI and SEC.
Weitere Informationen
Tagssys:relevantfor:isg  isg