Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
  
 

Information Integration

Description

Information integration is the merging of heterogeneous information from various data sources to a homogenous, clean dataset. This lecture introduces this ever-important topic. It will cover the basic technologies, such as distributed database architectures, techniques for virtual and materialized integration, and data cleansing technologies.

Further Information:

  • Lectures can be given in English, on demand.
  • Slides will be made available on the HPI-internal materials-folder.
  • The lectures will be recorded by tele-task.
  • The exercises are led by Tim Repke.

Schedule

The course will take place Mondays and Thursdays at 09:15 AM in HS 2. Some lectures will have the form of exercises.

Date Topic
MO 14.10. No Lecture - HPI Plenary Meeting
TH 17.10. Introduction
MO 21.10.  
TH 24.10.  
MO 28.10.  
Reformation Day  
MO 04.11.  
TH 07.11.  
MO 11.11.  
TH 14.11.  
MO 18.11.  
TH 21.11.  
MO 25.11.  
TH 28.11.  
MO 02.12.  
TH 05.12.  
MO 09.12.  
TH 12.12.  
MO 16.12.  
TH 19.12.  
Christmas break  
MO 06.01.  
TH 09.01.  
MO 13.01.  
TH 16.01.  
MO 20.01.  
TH 23.01.  
MO 27.01.  
TH 30.01.  
MO 03.02.  
TH 06.02.  
tbd Written exam in HS 1

Literature

  • Ulf Leser and Felix Naumann: Informationsintegration, dpunkt Verlag, 2006 (free pdf).
    This book is available at the UP library and also, e.g., from Amazon.de.
  • Doan, Halevy, and Ives: Principles of data integration, Morgan Kaufmann, 2012.
  • Özsu and Valduriez: Principles of distributed database systems, Springer, 2011.
  • Stefan Conrad: Föderierte Datenbanksysteme, Springer,  1997.

Throughout the lecture, I will refer to various scientific papers, that serve as in-depth references.

Exam

A written exam will take place on XXX in YYY.