We are excited to announce that the paper "PRISMA: A Privacy-Preserving Schema Matcher using Functional Dependencies" is accepted to be presented at the 28th International Conference on Extending Database Technology (EDBT) in 2025. We are very proud that the paper is a result of one of our master's projects.
Authors:
Jan-Eric Hellenberg (Hasso Plattner Institute)
Lukas Laskowski (Hasso Plattner Institute)
Fabian Mahling (Hasso Plattner Institute)
Felix Naumann (Hasso Plattner Institute)
Matteo Paganelli (University of Modena and Reggio Emilia)
Fabian Panse (Hasso Plattner Institute)
Abstract:
Schema matching is an essential step in many data integration processes and has been studied extensively in the literature. However, most matching approaches assume similarity of column names or instance data of the schemas to be matched and struggle when these are encoded differently.
We present PRISMA, a novel encoding-independent schema matching approach, which leverages functional dependencies to construct graph embeddings exploiting the encoding-independent structure of the schemas to be compared. We compare PRISMA against multiple baseline matchers as well as state-of-the-art competitors. The experiments demonstrate that PRISMA outperforms other approaches on databases that have large differences in their encodings, especially if these databases consist of multiple tables.