Prof. Dr. h.c. Hasso Plattner

Thomas Bodner

Research Assistant, PhD Candidate

Email: thomas.bodner(at)hpi.de
Phone: +49 (331) 5509 - 3934
Address: August-Bebel-Str. 88, 14482 Potsdam
Office: Campus II, V 2.05
Office Hours: Just stop by or mail/call ahead for an appointment
Profiles: DBLP, Google ScholarResearchGate, GitHubLinkedIn

Feel free to contact me for student theses and assistant positions.


Towards Cloud-based Enterprise Applications on Large Shared Datasets

Enterprises are moving their applications to public cloud environments to benefit from the resource elasticity and cost efficiency that their infrastructures provide. The resulting collocation of applications brings an opportunity to share application data within and across organizations for integrated analytics. Current database systems, however, do not exhibit either the in-memory performance for real-time analytics on data inside an organization, or the elasticity for efficient ad-hoc combination with large external data. In this work, we discuss a database architecture that exploits modern cloud infrastructure to combine both. Specifically, we make three contributions:

First, we present a main memory storage engine that is centered around a columnar, serialization-free, and interoperable data format. This storage engine enables efficient data exchange between application stacks by eliminating costly data transformations. Data can be accessed in-place over RDMA or as database checkpoints on remote shared cloud storage. The performance for both transactional and analytical data processing is kept up through auxiliary data structures, such as MVCC version chains, indexes, and filters.

Second, we propose a massively-parallel query engine that runs the relational operators as short-lived and stateless cloud functions against shared storage. The cloud functions are specified in C++ to utilize scarce per-function resources efficiently, e.g., via SIMD capabilities and tailored memory management. The query engine thereby embraces the fine-grained compute resource consumption model of current cloud platforms.

Third, we design a query plan optimizer that targets our execution engine. It takes into account the particularities of the cloud platform resources, i.e., strict time and space limits on compute units and multi-tiered storage with various price-performance points. It further respects the cloud provider's resource pricing to produce fast yet budget-conscious query plans.



  • Trends and Concepts in the Software Industry I (2021, 202020192018)


  • Trends and Concepts in the Software Industry II (2020)
  • Research and Implementation of Database Concepts (2020)
  • Develop your own Database (2019)

Bachelor's Projects:

Master's Projects:

  • Building an Elastic Query Engine on Serverless Cloud Infrastructure (2021)
  • Performance Engineering for Cloud-based Database Systems (2020)

Master's Theses:

  • Query Compilation for Distributed Execution with Cloud Functions (Julian Menzler, 2021)
  • Straggler Mitigation in Distributed Query Execution on Cloud Functions (Fabian Engel, 2021)
  • Elastic Query Execution via Short-lived and Stateless Cloud Functions (Jan Mensch, 2020)
  • Network Request Handling in Database Systems (Toni Stachewicz, 2019)
  • Data-dependent Implicit Authorizations for Fine-grained Database Access Control (Dennis Hempfing, 2018)
  • Pushing Down User-defined Functionality in Distributed Log-centric Big Data Stacks (Josephine Rückert, 2017)


  • Bodner, T.: Elastic Query Processing on Function as a Service Platforms.Proceedings of the VLDB PhD Workshop (2020).
  • Goel, A., Pound, J., Auch, N., Bumbulis, P., MacLean, S., Färber, F., Gropengiesser, F., Mathis, C., Bodner, T., Lehner, W.: Towards Scalable Real-time Analytics: An Architecture for Scale-out of OLXP Workloads.Proceedings of the VLDB Endowment. (2015).
  • Alexandrov, A., Schiefer, B., Poelman, J., Ewen, S., Bodner, T., Markl, V.: Myriad - Parallel Data Generation on Shared-nothing Architectures.Proceedings of the 1st Workshop on Architectures and Systems for Big Data. (2011).


Berg, G., Wenz, A., Hoeppner, B., Bodner, T., Cherepanova, O., Steffen, L., Siebert, J., Hennemann, D., Schulze, P., Dobler, K., Kahl, K., Beneke, P., Hoberg, P.:
Generation of Bots Based on Observed Behavior.
US 2020.

Bumbulis, P., Pound, J., Auch, N., Goel, A., Ringwald, M., Bodner, T., MacLean, S.:
An Algorithm for Consistent Replication of Log-Structured Data.
EU 2017 and US 2017.