Hasso-Plattner-Institut25 Jahre HPI
Hasso-Plattner-Institut25 Jahre HPI
Login
 

Explainable Data Matching (Sommersemester 2022)

Dozent: Prof. Dr. Felix Naumann (Information Systems)
Website zum Kurs: https://hpi.de/naumann/teaching/current-courses/ss-22/explainable-data-matching.html

Allgemeine Information

  • Semesterwochenstunden: 4
  • ECTS: 6
  • Benotet: Ja
  • Einschreibefrist: 01.04.2022 - 30.04.2022
  • Lehrform: Seminar
  • Belegungsart: Wahlpflichtmodul
  • Lehrsprache: Englisch
  • Maximale Teilnehmerzahl: 6

Studiengänge, Modulgruppen & Module

IT-Systems Engineering MA
Data Engineering MA
  • PREP: Data Preparation
    • HPI-PREP-K Konzepte und Methoden
  • PREP: Data Preparation
    • HPI-PREP-T Techniken und Werkzeuge
  • PREP: Data Preparation
    • HPI-PREP-S Spezialisierung

Beschreibung

Data matching is the process of detecting (and subsequently cleaning) multiple representations of the same real-world object within a given dataset. Typical approaches create a candidate set of record pairs, determine their similarity, and then compare it to some threshold. Such data matching systems and their components can be quite complex, and understanding their results is difficult. Building upon the data matching benchmark platform Frost and its implementation Snowman (pdf, github), we plan to develop methods to better explain data matching results to developers and domain experts.

Voraussetzungen

Foundations and experience in data cleaning and data matching

Literatur

Lern- und Lehrformen

Project seminar with weekly meetings, presentations and discussions

Leistungserfassung

Presentation and written report

Termine

Please see website

Zurück