Publications

We try to keep an up to date list of all our publications. If you are interested in a PDF that we have not uploaded yet, feel free to send us an email to get a copy. All recent publications you will find below. For older, please click appropriate year.

Publications of the years 2025, 2024, 2023, 2022, 2021, 2020, 2019, 2018, 2017, 2016, 2015, 2014, 2013, 2012, 2011, 2010, 2009, 2008, 2007

Springer LNCS

{ "authors" : [{ "lastname":"Lastname" , "initial":"F" , "url":"http://www.example.com" , "mail":"example(at)example.com" }]}

2020

Menon, P., Qadah, T.M., Rabl, T., Sadoghi, M., Jacobsen, H.-A.: LogStore: A Workload-aware, Adaptable Key-Value Store on Hybrid Storage Systems. Transactions on Knowledge and Data Engineering. (2020).

[ BibTeX ] [ Download ]

Silva, P., Wang, Y., Rabl, T.: Grand Challenge: Incremental Stream Query Analytics. Proceedings of the 14th ACM International Conference on Distributed and Event-based Systems (DEBS ’20). p. 6 (2020).

[ Abstract ] [ BibTeX ] [ DOI ] [ Download ]

Dreseler, M., Boissier, M., Rabl, T., Uflacker, M.: Quantifying TPC-H Choke Points and Their Optimizations [Experiments and Analyses]. Proceedings of the VLDB Endowment. pp. 1206–1220 (2020).

[ BibTeX ] [ URL ] [ Download ]

Derakhshan, B., Mahdiraji, A.R., Abedjan, Z., Rabl, T., Markl, V.: Optimizing Machine Learning Workloads in Collaborative Environments. ACM SIGMOD/PODS International Conference on Management of Data, Portland, OR, USA (2020).

[ Abstract ] [ BibTeX ] [ Download ]

@inproceedings{derakhshan2020optimizing,
  abstract = {Effective collaboration among data scientists results in high-quality and efficient machine learning (ML) workloads. In a collaborative environment, such as Kaggle or Google Colabratory, users typically re-execute or modify published scripts to recreate or improve the result. This introduces many redundant data processing and model training operations. Reusing the data generated by the redundant operations leads to the more efficient execution of future workloads. However, existing collaborative environments lack a data management component for storing and reusing the result of previously executed operations. In this paper, we present a system to optimize the execution of ML workloads in collaborative environments by reusing previously performed operations and their results. We utilize a so-called Experiment Graph (EG) to store the artifacts, i.e., raw and intermediate data or ML models, as vertices and operations of ML workloads as edges. In theory, the size of EG can become unnecessarily large, while the storage budget might be limited. At the same time, for some artifacts, the overall storage and retrieval cost might outweigh the recomputation cost. To address this issue, we propose two algorithms for materializing artifacts based on their likelihood of future reuse. Given the materialized artifacts inside EG, we devise a linear-time reuse algorithm to find the optimal execution plan for incoming ML workloads. Our reuse algorithm only incurs a negligible overhead and scales for the high number of incoming ML workloads in collaborative environments. Our experiments show that we improve the run-time by one order of magnitude for repeated execution of the workloads and 50% for the execution of modified workloads in collaborative environments.},
  author = {Derakhshan, Behrouz and Mahdiraji, Alireza Rezaei and Abedjan, Ziawasch and Rabl, Tilmann and Markl, Volker},
  booktitle = {ACM SIGMOD/PODS International Conference on Management of Data, Portland, OR, USA},
  keywords = {sys:relevantfor:des mlsystems sigmod},
  title = {Optimizing Machine Learning Workloads in Collaborative Environments},
  year = 2020
}

Grulich, P.M., Breß, S., Zeuch, S., Traub, J., von Bleichert, J., Chen, Z., Rabl, T., Markl, V.: Grizzly: Efficient Stream Processing Through Adaptive Query Compilation. ACM SIGMOD/PODS International Conference on Management of Data, Portland, OR, USA (2020).

[ Abstract ] [ BibTeX ] [ DOI ] [ Download ]

Lutz, C., Breß, S., Zeuch, S., Rabl, T., Markl, V.: Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects. ACM SIGMOD/PODS International Conference on Management of Data, Portland, OR, USA (2020).

[ Abstract ] [ BibTeX ] [ Download ]

Del Monte, B., Zeuch, S., Rabl, T., Markl, V.: Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines. ACM SIGMOD/PODS International Conference on Management of Data, Portland, OR, USA (2020).

[ Abstract ] [ BibTeX ] [ DOI ] [ Download ]

Kaitoua, A., Rabl, T., Markl, V.: A Distributed Data Exchange Engine for Polystores. it-Information Technology. (2020).

[ Abstract ] [ BibTeX ] [ Download ]

Benson, L., Grulich, P.M., Zeuch, S., Markl, V., Rabl, T.: Disco: Efficient Distributed Window Aggregation. Proceedings of the 23rd International Conference on Extending Database Technology (EDBT). OpenProceedings.org (2020).

[ Abstract ] [ BibTeX ] [ URL ] [ DOI ] [ Download ]

10.

Makait, H.: Rethinking Message Brokers on RDMA and NVM. Proceedings of the 2020 International Conference on Management of Data. ACM, Portland, OR, USA (2020).

[ BibTeX ] [ DOI ] [ Download ]

11.

Karimov, J., Rabl, T., Markl, V.: AJoin: Ad-hoc Stream Joins at Scale. Proceedings of the VLDB Endowment (2020).

[ Abstract ] [ BibTeX ] [ Download ]

Publications

Chair

News

20.11.2024 | Paper on Ecological Efficiency of Database Servers Accepted at CIDR 2025

09.08.2024 | Paper on Query Compilation for GPUs accepted at LWDA '24

18.07.2024 | Stork paper accepted at DATAI '24

08.03.2024 | CXL Buffer Management Paper Accepted at HardBD & Active '24

01.02.2024 | InferDB paper accepted at VLDB '24

Events

24.03.2022 | FG DB Symposium

Directions