Course Archive - Information Systems Group
Below is a list of past courses given by the Information Systems group.
Summer 2025
- Datenbanksysteme I (VL, BSc)
Felix Naumann, Youri Kaminsky - Data Integration (VL, MSc)
Felix Naumann, Sedir Mohammed - The Error Games: A Data Quality Challenge (PS, MSc)
Felix Naumann, Lisa Ehrlinger, Divya Bhadauria - Research Methods (SE, MSc)
Felix Naumann - Develop Your Own Database (PS, MSc)
Martin Boissier, Daniel Lindner, Florian Schmeller, Felix Naumann, Tilmann Rabl - The Early Bird: Upstream Change Detection for ML Pipelines (MP, MSc)
Felix Naumann, Lisa Ehrlinger, Sedir Mohammed
Winter 2024/2025
(Prof. Felix Naumann is on sabbatical in this semester)
- Datenbanksysteme II (VL, BSc)
Lisa Ehrlinger, Youri Kaminsky - Advanced Data Profiling (PS, MSc)
Sebastian Schmidl, Youri Kaminsky, Daniel Lindner, Felix Naumann - DQ4AI: Data Quality Assessment (PS, MSc)
Lisa Ehrlinger, Sedir Mohammed, Felix Naumann - Table Representation Learning (PS, MSc)
Francesco Pugnaloni, Lukas Laskowski, Felix Naumann
Summer 2024
- Datenbanksysteme I (VL, BSc)
Felix Naumann, Youri Kaminsky - Data Integration (VL, MSc)
Felix Naumann, Sebastian Schmidl - Raubkunst: Linking artworks, artists, and owners (Bachelorprojekt, pdf)
Felix Naumann, Sedir Mohammed in cooperation with JDCRP - DBStrange: Exploring the Multiverse of Entity Resolution Datasets (Master's project)
Fabian Panse, Lukas Laskowski, Felix Naumann
Winter 2023/24
- Datenbanksysteme II (VL, BSc)
Felix Naumann, Youri Kaminsky (Übungen) - Advanced Data Profiling (PS, MSc)
Sebastian Schmidl, Youri Kaminsky, Daniel Lindner, Felix Naumann - Data Cleaning and Integration (S, MSc)
Fabian Panse, Matteo Paganelli, Felix Naumann - Methoden der Forschung (S, MSc)
Felix Naumann - Lecture Series on Database Research (Lecture Series, BSc + MSc)
Tilmann Rabl, Felix Naumann - Raubkunst: Linking artworks, artists, and owners (Bachelorprojekt, pdf)
Felix Naumann, Sedir Mohammed in cooperation with JDCRP - Constraint-based Schema Matching (Masterprojekt)
Fabian Panse, Matteo Paganelli, Felix Naumann
Summer 2023
- Datenbanksysteme I (VL, BSc, English)
Hazar Harmouch, Leon Bornemann (Übungen) - Data Profiling (VL, MSc, English)
Felix Naumann, Youri Kaminsky (Übungen) - Data Quality for AI (PS, MSc)
Hazar Harmouch, Sedir Mohammed - Develop Your Own Database (PS, MSc)
Daniel Lindner, Marcel Weisgut, Martin Boissier, Stefan Halfpap, Thomas Bodner - Bachelorprojekt: Ein Feinkost Data Warehouse (mit Robert Lindner GmbH - https://www.lindner-esskultur.de)
Felix Naumann, Leon Bornemann
Winter 2022/23
- Datenbanksysteme II (VL, BSc)
Felix Naumann, Youri Kaminsky (Übungen) - Tagging and Captioning Art-Historical Photographs (PS, MSc)
Alejandro Sierra Múnera, Hendrik Rätz, Jona Otholt - Methoden der Forschung (S, MSc)
Felix Naumann - Approximate Data Profiling (PS, MSc)
Tobias Bleifuß, Youri Kaminsky, Felix Naumann - Bachelorprojekt: Ein Feinkost Data Warehouse (mit Robert Lindner GmbH - https://www.lindner-esskultur.de)
Felix Naumann - Master project: CAST: Classifying Time Series Anomalies
Sebastian Schmidl, Philipp Wenig
Summer 2022
- Datenbanksysteme I (VL, BSc)
Felix Naumann, Leon Bornemann (Übungen) - Information Integration (VL, MSc)
Felix Naumann, Tobias Bleifuß (Übungen) - Explainable Data Matching (PS, MSc)
Felix Naumann - Methoden der Forschung (S, MSc)
Felix Naumann - Knowledge Graphs meet Language Models (S, MSc)
Nitisha Jain and Alejandro Sierra-Múnera - BCNF*: Automatische Schemaanalyse und Datentransformation für Data Warehouses (Bachelorprojekt, in Kooperation mit Schweizerische Bundesbahnen SBB Cargo)
Felix Naumann, Youri Kaminsky - Music Walks (Master's project with Museum Barberini and Henrik Schwarz)
Alejandro Sierra-Múnera
Winter Semester 2021/22
- Datenbanksysteme II (VL, Bachelor, in Präsenz - online Teilnahme möglich)
Felix Naumann, Leon Bornemann (Übungen) - Lecture Series on Database Research (Ringvorlesung BA & MA - online Teilnahme möglich)
Tilmann Rabl, Felix Naumann - Methoden der Forschung (S, Master, in Präsenz)
Felix Naumann - Large-Scale Time Series Analytics (PS, Master, in Präsenz)
Sebastian Schmidl, Phillip Wenig, Felix Naumann - Data Quality for AI (PS, Master, in Präsenz)
Hazar Harmouch, Felix Naumann - BCNF*: Automatische Schemaanalyse und Datentransformation für Data Warehouses (Bachelorprojekt, in Kooperation mit Schweizerische Bundesbahn SBB Cargo)
Youri Kaminsky, Felix Naumann - Wikipedia Cleanup: Recognizing Stale Data (Masterprojekt)
Tobias Bleifuß, Leon Bornemann, Felix Naumann
Summer semester 2021
- Datenbanksysteme I (VL, Bachelor)
Felix Naumann, Leon Bornemann - Einführung in die Programmiertechnik II (VL, Bachelor)
Felix Naumann, Tobias Bleifuß - Distributed Data Management (VL, Master)
Thorsten Papenbrock, - Methoden der Forschung (SE, Master)
Felix Naumann - Table Recognition (PS, Master)
Gerardo Vitagliano, Felix Naumann - Building Machine Learning Applications (PS, Master)
Alexander Albrecht, Thorsten Papenbrock - Knowledge Graphs (SE, Master)
Ralf Krestel - UltraMine: Skalierbare Analyse von Messdatenströmen (Bachelorprojekt)
Thorsten Papenbrock, Phillip Wenig, Sebastian Schmidl - Data Matching Benchmark (Bachelorprojekt)
Felix Naumann - Generating Art using GANs (Masterprojekt)
Ralf Krestel, Gerard de Melo, Alejandro Sierra
Winter semester 2020/21
- Datenbanksysteme II (VL, Bachelor)
- UltraMine: Skalierbare Analyse von Messdatenströmen (Bachelorprojekt)
- Data Matching Benchmark (Bachelorprojekt)
- Data Profiling (VL, Master)
- Deep Learning for Text Mining (VL, Master)
- Practical Introduction to Deep Learning for Computer Vision (S, Master)
- Discovering Change Dependencies (Masterprojekt)
Summersemester 2020
(Prof. Felix Naumann was on sabbatical in this semester)
- Information Retrieval (VL, Bachelor)
- Distributed Data Management (VL Master)
- Museumserlebnisse mit Datenanalyse optimieren (Bachelorprojekt)
- Multimodal Analysis of Cultural Data (Masterprojekt)
- Sustainable Machine Learning on Edge Device Clusters (PS, Master)
- Solving the Climate Crisis with Text Mining (PS, Master)
- Truth Discovery Algorithms (S, Master)
Wintersemester 2019 / 2020
- Datenbanksysteme II (VL, Bachelor)
- Information Integration (VL, Master)
- Distributed Data Management (VL, Master)
- Data Engineering in Practice (VL, Bachelor/Master)
- Paint it Black: Ethik in der Datenanalyse (S, Bachelor)
- Paint it Black: Ethical Data Analytics (S, Master)
- Machine Learning for Data Streams (S, Master)
- Genealogy of Natural Language (Masterproject)
- Data Analytics – Museumserlebnisse mit Datenanalyse optimieren (Bachelorprojekt)
- In Kooperation mit dem Museum Barberini
Summer 2019
- Datenbanksysteme I (VL, Bachelor)
- Programmiertechnik II (VL, Bachelor)
- Unit Testing Data for Machine Learning (pdf, Bachelorprojekt)
- Processing Web Tables (PS, Master)
- Text Visualization (PS, Master)
- Methoden der Forschung (S, Master)
- Reliable Distributed Systems Engineering (PS, Master)
- Mining Streaming Data (PS, Master)
- What's in a life (Master project)
Winter 2018/2019
- Datenbanksysteme II (VL, Bachelor)
- Unit Testing Data for Machine Learning (pdf, Bachelorprojekt)
- Distributed Data Management (VL, Master)
- Deep Learning für Text Mining (VL, Master)
- Methoden der Forschung (S, Master)
- Data Preparation for Science (PS, Master)
- Recommender Systems (S, Master)
- Social Network Analysis in Practice (PS, Master)
Summer 2018
- Datenbanksysteme I (VL, Bachelor)
- Beautiful Data (SE, Bachelor)
- Horrible Data - Data Quality and Data Cleansing (SE, Master)
- Actor Database Systems (SE, Master)
- Text Mining in Practice (PS, Master)
- Methoden der Forschung (SE, Master)
- Vandalism Detection in Wikipedia Table Revisions (Master project)
- Leuchtturm im Datennebel: Exploration großer Dokumentensammlungen (Bachelorprojekt)
- Inventory Management - Die Vermessung des deutschen E-Commerce (Bachelorprojekt)
Winter 2017/18
- Datenbanksysteme II (VL, Bachelor)
- Data Engineering in der Praxis - Ringvorlesung (VL, Bachelor/Master)
- Information Retrieval and Web Search (VL, Master)
- Distributed Data Analytics (VL, Master)
- Information Integration (VL, Master)
- Probabilistic Graphical Models (SE, Master)
- Advanced Data Profiling (PS, Master)
- Building Scalable Blockchain Applications with Big Data Technology (PS, Master)
- Leuchtturm im Datennebel: Exploration großer Dokumentensammlungen (Bachelorprojekt)
- Inventory Management - Die Vermessung des deutschen E-Commerce (Bachelorprojekt)
Summer 2017
- Datenbanksysteme I (VL, Bachelor)
- Text Mining (S, Bachelor)
- Data Profiling (VL, Master)
- Recommender Systems (S, Master)
- Ingestion: Datenaufnahme und -analyse für Semantische Unternehmensnetzwerke (Bachelorprojekt)
- Hate Speech Detection (Master project)
Winter 2016/2017
- Data Mining and Probabilistic Reasoning (VL, Master)
- Profiling Dynamic Data (Master Project)
- Distributed Duplicate Detection (Projectseminar, Master)
- Ingestion: Datenaufnahme und -analyse für Semantische Unternehmensnetzwerke (Bachelorprojekt)
Prof. Naumann was on sabbatical leave in this semester.
Summer 2016
- Einführung in die Programmiertechnik II (VL, Bachelor)
- Datenbanksysteme I (VL, Bachelor)
- Bachelorprojekte
- Analysewerkzeuge für semantische Unternehmensnetzwerke
- Data Refinery: High-Performance-Datenaufbereitung für den idealo-Preisvergleich
- Advanced Topic Modeling (SE, Master)
- Mining Massive Datasets (SE, Master)
- Incremental Duplicate Detection (SE, Master)
- TIF
Winter 2015/2016
- Datenbanksysteme II (Vorlesung, Bachelor)
- Information Integration (Lecture, Master)
- Information Retrieval and Web Search (Lecture, Master)
- Bachelorprojekte
- Analysewerkzeuge für semantische Unternehmensnetzwerke
- Data Refinery: High-Performance-Datenaufbereitung für den idealo-Preisvergleich
- Masterproject: Learning to Note - Intelligent Support for Document Annotation using Semi-Supervised Learning
- Research Seminar and InfoLunch
Summer 2015
- Datenbanksysteme I (Vorlesung, Bachelor)
- Data Mining and Probabilistic Reasoning (Lecture, Master)
- Distributed Big Data Analytics (Projectseminar, Master)
- Bachelorprojekte
- Masterprojekt: Approximate Data Profiling
- Research Seminar and InfoLunch
Winter 2014/15
- Datenbanksysteme II (Vorlesung, Bachelor)
- Proseminar Informationssysteme (Seminar, Bachelor)
- Data Profiling and Data Cleansing (Lecture, Master)
- Recommender Systems (Seminar, Master)
- Bachelorprojekte
- Masterprojekt: Metadata Trawling
- Research Seminar and InfoLunch
Summer 2014
- Einführung in die Programmiertechnik II (Vorlesung, Bachelor)
- Datenbanksysteme I (Vorlesung, Bachelor)
- Information Retrieval (Lecture, Master)
- Search Engine Implementation (Project Seminar, Master)
- Natural Language Processing (Lecture, Master)
- Bachelor projects
- Master project: Finding Relevant Tweets for News
Results published as WWW 2015 poster - Research Seminar and InfoLunch
Winter 2013/14
- Datenbanksysteme II (Vorlesung, Bachelor)
- Beautiful Data (Seminar, Bachelor)
- Data Mining and Probabilistic Reasoning (Vorlesung, Master)
- Advanced Data Profiling (Project/Seminar, Master)
Results published as VLDB 2015 E&A paper - Master projects
- Piggyback Profiling: Metadata for Query Results
- Joint Data Profiling
- Bachelor projects
- Forschungsseminar and InfoLunch
Summer 2013
- OpenHPI Onlinekurs "Datenmanagement mit SQL"
- Datenbanksysteme I (Vorlesung, Bachelor)
- Data Profiling and Data Cleansing (Vorlesung, Master)
- Large Scale Duplicate Detection (Projectseminar, Master)
- Advanced Recommendation Techniques (Projectseminar, Master)
- VIP 2.0: Celebrity Exploration (Bachelorproject)
- Forschungsseminar und InfoLunch
Winter 2012/13
- Information Retrieval (Lecture, Master)
- FactScore: Global Relevance Scores for DBpedia Facts (Masterproject)
- VIP 2.0: Celebrity Exploration (Bachelorproject)
- Forschungsseminar and InfoLunch
Prof. Naumann was on sabbatical leave in this semester.
Summer 2012
- Datenbanksysteme I (Vorlesung, Bachelor)
- Beauty is our Business (Seminar, Bachelor)
- 3 Master Lectures on Information Management: See this overview
- Natural Language Processing (Lecture, Master)
- Data Mining and Probabilistic Reasoning (Lecture, Master)
- Information Integration (Lecture, Master)
- Algorithms for Pattern Mining (Seminar, Master)
- Mainframe Computing Summit (Block course, Bachelor & Master)
- Bachelor's projects
- Forschungsseminar & Infolunch
Winter 2011/2012
- Datenbanksysteme II (Vorlesung, Bachelor)
- Question Answering: Who wants to be a millionaire? (Seminar, Master)
- Scalable Data Analysis Algorithms (Seminar, Master)
- Bachelor's projects
- LuSim: Similarity Search with Apache Lucene (Masterprojekt)
Results published at DBRank Workshop 2012 - Workshop Datenreinigung (10. - 12. Oktober)
für Bachelor-, Master-, und PhD-Studenten - Forschungsseminar & Infolunch
Summer 2011
- Datenbanksysteme I (Vorlesung, Bachelor)
- Übung zur Vorlesung
- Beauty is our Business (Seminar, Bachelor)
- NoSQL (Seminar, Bachelor)
- Search Engines (Vorlesung, Master)
- Collaborative Filtering (Seminar, Master)
- Master's project: Duplikaterkennung auf GPUs
Results published at BTW Conference 2013 - Bachelor's projects:
- Forschungsseminar
Lehrangebot Winter 2010/2011
- Datenbanksysteme II (Vorlesung, Bachelor)
- Übung zur Vorlesung
- Mobile Computing with Android (Projektseminar, Bachelor)
- Duplicate Detection (Seminar, Master)
- BlackSwan: Automated annotation of global statistics (Projektseminar, Master)
Results published at CIKM 2011 - Bachelorprojekte
- ProCSIA: Profiling Column Stores with IBM’s Information Analyzer
- Faceted Search – Selbstkonfigurierende Produktsuchmodelle
- Workshop: Datenreinigung - 13.10. - 15.10.
- Forschungsseminar
Lehrangebot Sommer 2010
- Datenbanksysteme I (Vorlesung, Bachelor)
- Übung zur Vorlesung
- Mobile Application Development (Projektseminar, Bachelor)
- Beauty is our Business (Seminar, Bachelor)
- Informationsintegration (Vorlesung, Master)
- Large-Scale Data Analysis on Cloud Platforms (Projektseminar, Master)
Results published in Billion Triple Challenge at ISWC 2010 - Similarity Search Algorithms (Projektseminar, Master)
Reported on in Datenbankspektrum 2011 - Entity-centric Information Retrieval (Projektseminar, Master)
- Bachelorprojekte
- ETL-Prozess-Management für BMW Financial Services
- Midas: Extreme Web Data Integration for Government Data
Extended to GovWILD.org; published at ISemantics 2010 and WWW 2012 (demo)
- Forschungsseminar
Lehrangebot Winter 2009/2010
- Datenbanksysteme II (Vorlesung und Übung, Bachelor)
- Beauty is our Business (Seminar, Bachelor)
- Advanced Map-Reduce-Algorithms on Hadoop (Projektseminar, Master)
- Emerging Web Services Technology (Seminar, Master)
- Workshop Datenreinigung (Bachelor und Master) 7.10. - 9.10.2009
- Bachelorprojekte
- ETL-Prozess-Management für BMW Financial Services
- Midas: Extreme Web Data Integration for Government Data
- Forschungsseminar
Lehrangebot Sommer 2009
- Datenbanksysteme I (Vorlesung und Übung)
- Übung zur Vorlesung
- Achtung: Für HPI Bachelorstudenten im 2. und 4. Semester
- Beauty is our Business (Proseminar, Bachelor)
- Map/Reduce Algorithms on Hadoop (Projektseminar, Bachelor)
- Search Engines (Vorlesung, Master)
- Übung zur Vorlesung
- Linked Data Profiling (Projektseminar, Master)
Published at NTII workshop 2010 - Bachelorprojekt: Optimierung regelbasierter Datentransformationen
- Forschungsseminar
Lehrangebot Winter 2008/2009
Prof. Naumann verbrachte in diesem Semester ein Forschungsfreisemester.
- Advanced Topics in Databases (Seminar, Bachelor)
- Information Retrieval (Seminar, Master)
- Bachelorprojekt: Optimierung regelbasierter Datentransformationen
- Forschungsseminar
Lehrangebot Sommer 2008
- Datenbanksysteme II (Vorlesung und Übung, Bachelor)
- Informationsintegration (Vorlesung und Übung, Master)
- Beauty is our Business (Seminar, Bachelor)
- www.ligageschichte.de (Projektseminar, Bachelor)
- Duplikaterkennung (Projektseminar, Master)
- FUZZY! Workshop - Datenreinigung (9.4. - 11.4.)
- 3-tägiger Intensiv-Workshop für Bachelor und Master
- Bachelorprojekt 1: HighQ - Informationsintegration mit dem IBM Information Server
- Bachelorprojekt 2: Datenfusion - Konsolidierung widersprüchlicher Daten
- Forschungsseminar
Lehrangebot Winter 2007/2008
- Datenbanksysteme I (Bachelorvorlesung)
- Beauty is our Business (Bachelorseminar)
- www.ProminentPeople.info (Bachelorseminar)
Extended to win the IEEE Services Cup 2008 - Schema Matching (Masterseminar)
- Forschungsseminar
- FUZZY! Workshop zur Datenreinigung (8.10. - 10.10.)
- Bachelorprojekt 01: HighQ - Informationsintegration mit dem IBM Information Server
- Bachelorprojekt 02: Datenfusion - Konsolidierung widersprüchlicher Daten