Our team is giving a series of lectures and seminars with a focus on enterprise systems design and in-memory data management. Strong links to the industry ensure a close connection between theory and its implementation in the real world.

If you are having questions regarding one of our publications, please contact the authors.

Dynamic Programming and Reinforcement Learning

General Information

Teaching staff: Dr. Rainer Schlosser, Alexander Kastius
6 ECTS (graded), 4 Semesterwochenstunden (SWS)
Lecture format: VL/UE
Enrollment time: 01.04.2022 - 30.04.2022
Time: Monday 15.15-16.45 and Thursday 13.30-15.00
Room: Mon L-1.02 + Thu L-1.06 (online via Zoom https://zoom.us/j/7271364393 Password: 256757)
Language: English/German
Program:
- IT-Systems Engineering: BPET, OSIS
- Data Engineering: DATA
- Digital Health: SCAD, DICR (see preconditions)
Project phase: individual appointments (in person/hybrid)
Final Presentations: August 8, 2022

Short Description

The need for automated decision-making is steadily increasing. Hence, data-driven decision-making techniques are essential. We assume a system that follows certain dynamics and has to be tuned or controlled over time such that certain constraints are satisfied and a specified objective is optimized. Typically, the current state of the system as well as the interplay of rewards and potential future states associated to certain actions have to be taken into account. The dynamics and state transitions may have to be estimated from data using suitable ML-based techniques.

As, in general, exact solution approaches of such dynamic optimization problems do not scale often heuristics have to be used (e.g., in case the number of states is too large, cf. curse of dimensionality). Besides classical approaches such as dynamic programming (DP) state-of-the-art heuristic optimization techniques such as approximate dynamic programming (ADP) or reinforcement learning (RL) are suitable alternatives.

Goals of the Course

Understand...

opportunities and challenges of decision-making
static deterministic problems
stochastic dynamic problems
optimization models and solution techniques

Do ...

work in small teams
set up suitable models, apply optimization techniques
simulate controlled processes, compare performance results

Improve/Learn ...

mathematical, analytical, and modelling skills
optimization techniques
dynamic programming methods
reinforcement learning methods

Preconditions

interest in quantitative methods and stochastics
programming skills/experience
the number of participants is not restricted

Teaching and Learning Process

The course is a combination of a lecture and a practical part:

teachers impart relevant knowledge and methods
students work on a self-containing topic in a team of ca. 3 people
students present and document their work

Grading

Portfolio assessment for ITSE, DE, and DH-students consisting of:

(i) final presentation of project results (July 18)
(ii) project documentation at the end of the module (Sep 15)

Material/Preparation

Slides and Upcoming Topics

1. Week: First Introduction (online) (April 21)
2. Week: Finite Time Markov Decision Processes (online) + Infinite Time MDPs (in person) (April 25/28)
3. Week: Approximate Dynamic Programming (ADP) + Implementation Exercise + (May 2/5)
4. Week: Q-Learning (QL) D-E.9/10 (May 12, not 9)
5. Week: Deep Q-Networks (DQN) + 2.Teil + DQN Extensions (May 16/19)
6. Week: Implementations & Open AI Gym (May 23, not 26)
7. Week: Policy Gradient Algorithms + Policy Gradient Algorithms 2 (May 30, June 2)
8. Week: Project Assignments (June 9, not 6)
9. Week: Project/Feedback (June 13/16)
10. Week: Project/Feedback (June 20/23)
11. Week: Project/Feedback (June 27/30)
12. Week: Project/Feedback (July 4/7)
13. Week: Project/Feedback (July 11/14)
14. Week: Final Presentations (July 18/21)
15. Week: Project/Feedback (July 25)
Documentations: Deadline September 15 (ca. 15 pages, e.g., LNCS)

Exercises:

DP Example (results)
QL Example (results)
Gym Exercises Cart Pole & Lunar Lander (see moodle)

Material:

News

22.09.2023 | Trends and Concepts in the Softwareindustry Seminar offered in WiSe 2023/2024

Trends and Concepts in the Softwareindustry Seminar offered in WiSe 2023/2024 > Zum Artikel

22.05.2023 | Christopher Hagedorn Successfully Defended His PhD Thesis

Christopher Hagedorn Successfully Defended His PhD Thesis > Zum Artikel

03.03.2023 | Last Trends and Concepts course of Prof. Hasso Plattner

After more than 20 years of teaching, our founder and benefactor Prof. Hasso Plattner visited the HPI this week for his … > Zum Artikel

01.03.2023 | Jan Kossmann Successfully Defended His PhD Thesis

Last week, Jan Kossmann another PhD student of our EPIC group successfully defended his thesis on the topic of … > Zum Artikel

26.02.2023 | Paper on Data Tiering in Hyrise Published in BTW Proceedings

Our latest paper on data tiering in Hyrise "Workload-Driven Data Placement for Tierless In-Memory Database Systems" by … > Zum Artikel

24.02.2023 | Paper on EPIC Research Group Published in SIGMOD Record

Our report “Enterprise Platform and Integration Concepts Research at HPI” has been published in the December issue of … > Zum Artikel

30.11.2022 | Paper on Database Optimizations for Spatio-Temporal Data published in PVLDB

Our paper “Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems” has been published in … > Zum Artikel

04.10.2022 | Günter Hesse Successfully Defended His PhD Thesis

Last week, Günter Hesse another PhD student of our EPIC group successfully defended his thesis on the topic of "A … > Zum Artikel

08.07.2022 | Successful PhD Defense by Markus Dreseler

Markus Dreseler has successfully defended his PhD thesis on Automatic Tiering for In-Memory Database Systems. > Zum Artikel

Literature

"A Course in In-Memory Data Management" by Prof. Dr. h.c. Hasso Plattner. This book is the culmination of six years work of in-memory research. As such, it provides the technical foundation for combined transactional and analytical workloads inside one single database as well as examples of new applications that are now possible given the availability of the new technology. The book is available at Springer.