Publications

We try to keep an up to date list of all our publications. If you are interested in a PDF that we have not uploaded yet, feel free to send us an email to get a copy. All recent publications you will find below. For older, please click appropriate year.

Publications of the years 2024, 2023, 2022, 2021, 2020, 2019, 2018, 2017, 2016, 2015, 2014, 2013, 2012, 2011, 2010, 2009, 2008, 2007

Springer LNCS

{ "authors" : [{ "lastname":"Lastname" , "initial":"F" , "url":"http://www.example.com" , "mail":"example(at)example.com" }]}

2023

Mahling, F., Rößler, P., Bodner, T., Rabl, T.: BabelMR: A Polyglot Framework for Serverless MapReduce. Workshop on Serverless Data Analytics. (2023).

[ Abstract ] [ BibTeX ] [ URL ]

Benson, L., Ebeling, R., Rabl, T.: Evaluating SIMD Compiler-Intrinsics for Database Systems. 14th International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processors and Storage Architectures. (2023).

[ Abstract ] [ BibTeX ] [ URL ] [ Download ]

Brücke, C., Härtling, P., Escobar Palacios, R.D., Patel, H., Rabl, T.: TPCx-AI - An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems. Proceedings of the VLDB Endowment. 16, 3649–3661 (2023).

[ Abstract ] [ BibTeX ] [ URL ] [ DOI ] [ Download ]

@article{noauthororeditor,
  abstract = {Artificial intelligence (AI) and machine learning (ML) techniques have existed for years, but new hardware trends and advances in model training and inference have radically improved their perfor- mance. With an ever increasing amount of algorithms, systems, and hardware solutions, it is challenging to identify good deployments even for experts. Researchers and industry experts have observed this challenge and have created several benchmark suites for AI and ML applications and systems. While they are helpful in comparing several aspects of AI applications, none of the existing benchmarks measures end-to-end performance of ML deployments. Many have been rigorously developed in collaboration between academia and industry, but no existing benchmark is standardized. In this paper, we introduce the TPC Express Benchmark for Arti- ficial Intelligence (TPCx-AI), the first industry standard benchmark for end-to-end machine learning deployments. TPCx-AI is the first AI benchmark that represents the pipelines typically found in com- mon ML and AI workloads. TPCx-AI provides a full software kit, which includes data generator, driver, and two full workload imple- mentations, one based on Python libraries and one based on Apache Spark. We describe the complete benchmark and show benchmark results for various scale factors. TPCx-AI’s core contributions are a novel unified data set covering structured and unstructured data; a fully scalable data generator that can generate realistic data from GB up to PB scale; and a diverse and representative workload using different data types and algorithms, covering a wide range of as- pects of real ML workloads such as data integration, data processing, training, and inference.},
  author = {Brücke, Christoph and Härtling, Philipp and Escobar Palacios, Rodrigo D and Patel, Hamesh and Rabl, Tilmann},
  journal = {Proceedings of the VLDB Endowment},
  keywords = {ai benchmarking tpc},
  number = 12,
  pages = {3649 - 3661},
  title = {TPCx-AI - An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems},
  volume = 16,
  year = 2023
}

Böther, M., Benson, L., Klimovic, A., Rabl, T.: Analyzing Vectorized Hash Tables Across CPU Architectures. Proceedings of the VLDB Endowment. 16, 2755–2768 (2023).

[ Abstract ] [ BibTeX ] [ URL ] [ DOI ] [ Download ]

Wang, Y., Benson, L., Rabl, T.: Desis: Efficient Window Aggregation in Decentralized Networks. 26th International Conference on Extending Database Technology (EDBT ’23) (2023).

[ Abstract ] [ BibTeX ] [ URL ] [ Download ]

@inproceedings{streamprocessing,
  abstract = {Stream processing is widely applied in industry as well as in research to process unbounded data streams. In many use cases, specific data streams are processed by multiple continuous queries. Current systems group events of an unbounded data stream into bounded windows to produce results of individual queries in a timely fashion. For multiple concurrent queries, multiple concurrent and usually overlapping windows are generated. To reduce redundant computations and share partial results, state-of-the-art solutions divide windows into slices and then share the results of those slices. However, this is only applicable for queries with the same aggregation function and window measure, as in the case of overlaps for sliding windows. For multiple queries on the same stream with different aggregation functions and window measures, partial results cannot be shared. Furthermore, data streams are produced from devices that are distributed in large decentralized networks. Current systems cannot process queries on decentralized data streams efficiently. All queries in a decentralized network are either computed centrally or processed individually without exploiting partial results across queries. We present Desis, a stream processing system that can efficiently process multiple stream aggregation queries. We propose an aggregation engine that can share partial results between multiple queries with different window types, measures, and aggregation functions. In decentralized networks, Desis moves computation to data sources and shares overlapping computation as early as possible between queries. Desis outperforms existing solutions by orders of magnitude in throughput when processing multiple queries and can scale to millions of queries. In a decentralized setup, Desis can save up to 99% of network traffic and scale performance linearly.},
  author = {Wang, Yue and Benson, Lawrence and Rabl, Tilmann},
  journal = {26th International Conference on Extending Database Technology (EDBT '23)},
  keywords = {myown streampocessing windowaggregation},
  title = {Desis: Efficient Window Aggregation in Decentralized Networks},
  year = 2023
}

Strassenburg, N., Kupfer, D., Kowal, J., Rabl, T.: Efficient Multi-Model Management. 26th International Conference on Extending Database Technology (EDBT ’23). (2023).

[ Abstract ] [ BibTeX ] [ URL ] [ Download ]

Ilic, I., Tolovski, I., Rabl, T.: RMG Sort: Radix-Partitioning-Based Multi-GPU Sorting. Datenbanksysteme für Business, Technologie und Web (BTW 2023) (2023).

[ Abstract ] [ BibTeX ] [ Download ]

Publications

Chair

News

09.08.2024 | Paper on Query Compilation for GPUs accepted at LWDA '24

18.07.2024 | Stork paper accepted at DATAI '24

08.03.2024 | CXL Buffer Management Paper Accepted at HardBD & Active '24

01.02.2024 | InferDB paper accepted at VLDB '24

01.02.2024 | POLAR paper accepted at VLDB '24

Events

24.03.2022 | FG DB Symposium

Directions