Hasso-Plattner-Institut
Prof. Dr. h.c. mult. Hasso Plattner
 

Skyrise: A Serverless Query Processor

General Information

Enterprises increasingly run the applications supporting their business processes in the cloud. Application data residing in the cloud expand the importance of cloud-based analytical workloads, which require provisioned infrastructure before any query processing can begin. Resource provisioning can be difficult for these workloads because they are often unpredictable and ad-hoc in nature. Overprovisioning and reduced cost-efficiency are the norm to avoid disruption of performance due to insufficient resources.

Recently, cloud providers have introduced means to allocate and bill fine-granular units of resources with function-as-a-service (FaaS) compute platforms and shared object storage systems. We evaluate this so-called serverless infrastructure regarding its performance elasticity and variability. Based on our findings, we build the Skyrise serverless query processor that interhits the elastic scalability of its underlying FaaS infrastructure while it deals with the limitations and inefficiencies. Skyrise enables cost-efficient, interactive analytics on infrequently accessed data, a workload for which conventionally provisioned database systems are idle most of the time and thus not viable.

The project team consists of Thomas Bodner, David JustenDr. Michael Perscheid and Prof. Dr. h.c. mult. Hasso Plattner. We thank all student contributors, without whom this project would not be where it is:
Lars Jonas Bollmeier, Fabian Engel, Jakob Köhler, Tobias Maltenberger, Jan Mensch, Julian Menzler, Timon Millich, Tobias Pietz, Theo Radig, Niklas Riekenbrauck, Benedikt Schenkel, Yannik Schröder, Pascal Schulze, Jan Siebert and Ben Wegener.
 

Publications

  • 1.
    Justen, D.: Cost-efficiency and Performance Robustness in Serverless Data Exchange. Proceedings of the 2022 International Conference on Management of Data (SIGMOD ’22), Student Research Competition. pp. 2506–2508. Association for Computing Machinery, New York, NY, USA (2022).
     
  • 2.
    Bodner, T., Pietz, T., Bollmeier, L.J., Ritter, D.: Doppler: Understanding Serverless Query Execution. Proceedings of the SIGMOD Workshop on Big Data in Emergent Distributed Environments. (2022).
     
  • 3.
    Bodner, T.: Elastic Query Processing on Function as a Service Platforms. Proceedings of the VLDB PhD Workshop. (2020).