Dr. Ioannis Koumarelas

I am a former Ph.D. student at the Infomation Systems Research Group and my research started in collaboration with SAP and SAP Concur. Through my Ph.D. I have worked in the general area of Data Cleaning, Data Preparation, with my main focus on Duplicate Detection.

Hasso-Plattner-Institut
für Softwaresystemtechnik
Prof.-Dr.-Helmert-Straße 2-3
D-14482 Potsdam
Office: F-2.05, Campus II

Phone: +49 331 5509 1377
Email: Ioannis Koumarelas (click)
Research: GoogleScholar, ResearchGate, DBLP
Profiles: LinkedIn, GitHub

Research Interests

Duplicate Detection (Record Linkage, Entity Resolution etc.), Data Cleaning, Data Preparation
Address Geocoding
Parallel and Distributed Systems, Big Data Management
Data Profiling
Data Mining, Machine Learning, Deep Learning

Projects

Cooperation project with SAP and SAP Concur, for Vendor Data Cleaning of hotels. Our main task has been to apply Duplicate Detection, thus identify duplicates and understand what are their causes. The approaches we followed mainly use data preparation and matching dependencies, for which more information is further available through our publications.

Teaching

Publication list

2021

Loster, M., Koumarelas, I., Naumann, F.: Knowledge Transfer for Entity Resolution with Siamese Neural Networks. Journal of Data and Information Quality. 13, (2021).

[ Abstract ] [ BibTeX ] [ URL ] [ Details ]

2020

Koumarelas, I., Jiang, L., Naumann, F.: Data Preparation for Duplicate Detection. Journal of Data and Information Quality (JDIQ). 12, 1–24 (2020).

[ Abstract ] [ BibTeX ] [ Details ]

Schirmer, P., Papenbrock, T., Koumarelas, I., Naumann, F.: Efficient Discovery of Matching Dependencies. ACM Transactions on Database Systems (TODS). 45, 1–33 (2020).

[ BibTeX ] [ URL ] [ Download ] [ Details ]

Koumarelas, I., Papenbrock, T., Naumann, F.: MDedup: Duplicate Detection with Matching Dependencies. Proceedings of the VLDB Endowment (PVLDB). 13, 712–725 (2020).

[ Abstract ] [ BibTeX ] [ Download ] [ Details ]

2018

Koumarelas, I., Kroschk, A., Mosley, C., Naumann, F.: Experience: Enhancing Address Matching with Geocoding and Similarity Measure Selection. Journal of Data and Information Quality (JDIQ). 10, 8:1–8:16 (2018).

[ Abstract ] [ BibTeX ] [ URL ] [ Download ] [ Details ]

Pietrangelo, A., Simonini, G., Bergamaschi, S., Naumann, F., Koumarelas, I.: Towards Progressive Search-driven Entity Resolution. Italian Symposium on Advanced Database Systems (SEBD) (2018).

[ Abstract ] [ BibTeX ] [ URL ] [ Download ] [ Details ]

2016

Samiei, A., Koumarelas, I., Loster, M., Naumann, F.: Combination of Rule-based and Textual Similarity Approaches to Match Financial Entities. Data Science for Macro-Modeling with Financial and Economic Datasets (DSMM). ACM (2016).

[ Abstract ] [ BibTeX ] [ URL ] [ Download ] [ Details ]

Dr. Ioannis Koumarelas

Research Interests

Projects

Teaching

Publication list

Chair

News

06.10.2024 | Paper accepted at EDBT 2025

06.09.2024 | Congratulations Dr. Phillip Wenig

06.09.2024 | Congratulations Dr. Mazhar Hameed!

16.07.2024 | Congratulations Dr. Leon Bornemann-Paulus!

23.05.2024 | Paper accepted at NLDB 2024

Project highlights

People and open positions