Research and Implementation of Database Concepts

General Information

Teaching staff: Thomas Bodner, Dr. Daniel Ritter, Dr. Michael Perscheid, Dr. Rainer Schlosser
4 Semesterwochenstunden (SWS) - 6 ECTS (graded)
First meeting: 25 October 2021
Room: Online via Zoom (Passcode: 00977962) and A1.2 (changed!) - choose whatever suits you
Time: Monday 15:15 (only applies to first meeting. Afterwards, appointsments are scheduled with the project supervisor)
Enrollment: 1 October until 22 October 2021
Exam date: no exam, see schedule below
Specialization areas:
- ITSE: BPET, OSIS, ITSE-Analyse, ITSE-Maintenance
- DATA: Scalable Data Systems
Slides of introductory meeting

About this Seminar

Our database research seminar invites students that are interested in working on research-related topics in the area of database systems and, in particular, our research database systems Hyrise and Skyrise. An introduction is given in the Hyrise and Skyrise research papers and the open source Hyrise repository.

Logistics

In the first meeting, we will introduce the instructors and present the different topics.
The first meeting will be held online.
Submit your choices of topics that interest you until October 31, 2021. Topic assignments will be announced on November 1, 2021. (Details discussed in 1st meeting).
Following meetings will be held in the different groups. Depending on the preferences of you and your instructor, these can be on- or offline.

Example Topics

This list of topics is not exhaustive and we are happy to discuss research projects based on your previous experience and personal interests.

In-Memory Pipelined Query Execution: The pipelined query execution model passes intermediate results between query operators a tuple-at-a-time or a-batch-a-time, and not in their entirety. This benefits the memory footprint and enables parallelism along pipelines of operators. In this work, we remodel the query execution within the FaaS-based Skyrise workers to pipeline intermediates.
Analyzing Traces of Serverless Query Execution: An inherent issue of serverless software systems is the observability of their inner mechanics, rendering debugging and profiling efforts cumbersome. Skyrise has a monitoring subsystem that generates a myriad of logs, metrics, and traces per query executed in parallel on hundreds to thousands of workers. This topic is about effectively and efficiently analyzing these artifacts to help database developers better understand serverless query execution.

Learning Goals

Participants will deepen their understanding of data management technologies, improve their system’s development skills by working with a large existing code base. Additionally, they will gain experience in the scientific method and writing, which will serve as a preparation for their upcoming master’s theses.

Seminar Schedule

Topics: During the first week of the lecture period, potential topics will be presented by the supervisors and chosen by the participants. The topics can be worked on alone or in groups of two.
Familiarization: The participants are expected to familiarize themselves with the chosen topic and study recent publications that are provided by the supervisors.
Project: Afterwards, implementations and evaluations will be conducted while participants receive guidance by the supervisors.
Final Presentations of approximately 30 minutes (~20 min. presentation + 10 min. Q&A) will be held after the end of the lecture period on February 28, 2022 (expected).
Scientific Report: In the end, a scientific report (4-8 pages, depending on the group size, in ACM format) should set the targeted problem into context (challenges, motivation, and related work), document the taken approach, and present evaluations as well as learnings to answer raised research questions. The expected date for the final report is March 20, 2022.

Prerequisites

Good knowledge of C++ and/or Python
Basic knowledge of database systems (e.g., DBS or TuK I lectures)
Former attendance of the Develop Your Own Database seminar is beneficial but not obligatory

Grading

50% project result and presentation
40% scientific report
10% personal engagement

Research and Implementation of Database Concepts

General Information

About this Seminar

Logistics

Example Topics

Learning Goals

Seminar Schedule

Prerequisites

Grading

News

22.09.2023 | Trends and Concepts in the Softwareindustry Seminar offered in WiSe 2023/2024

22.05.2023 | Christopher Hagedorn Successfully Defended His PhD Thesis

03.03.2023 | Last Trends and Concepts course of Prof. Hasso Plattner

01.03.2023 | Jan Kossmann Successfully Defended His PhD Thesis

26.02.2023 | Paper on Data Tiering in Hyrise Published in BTW Proceedings

24.02.2023 | Paper on EPIC Research Group Published in SIGMOD Record

30.11.2022 | Paper on Database Optimizations for Spatio-Temporal Data published in PVLDB

04.10.2022 | Günter Hesse Successfully Defended His PhD Thesis

08.07.2022 | Successful PhD Defense by Markus Dreseler

Literature

Contact