Hasso-Plattner-Institut
Prof. Dr. Tilmann Rabl
 

Martin Boissier

Ph.D. Student
Email: martin.boissier(at)hpi.de
Room: F-1.06
Profiles: DBLP - personal website

Full Profile

Research

Main Memory Footprint Reduction of In-Memory Database Systems

Database systems that keep their data primarily in main memory provide high query performance but also incur high costs. We have analyzed various real-world enterprise systems and their workload and data characteristics. We found that the main memory footprint can be efficiently reduced by (i) data encoding and (ii) tiering without degrading performance significantly. 
To encode and compress a database instance, we use learned cost models to predict runtimes of various data encodings. We use linear programming models to determine optimal encoding configurations within a given memory budget. For the applicability in real-world scenarios, the models incorporate robustness measures that mitigate unexpected performance degradations. To efficiently tier data to secondary storage, we extended the hybrid data layout of the first version of Hyrise and evict infrequently accessed columns in a row-major format.

 

Selected Publications

  • Workload-Driven Data Plac... - Download
    [1]
    Hurdelhey, B., Weisgut, M. and Boissier, M. Workload-Driven Data Placement for Tierless In-Memory Database SystemsDatenbanksysteme für Business, Technologie und Web, BTW (2023), 47–70.
     
  • Budget-Conscious Fine-Gra... - Download
    [2]
    Richly, K., Schlosser, R. and Boissier, M. Budget-Conscious Fine-Grained Configuration Optimization for Spatio-Temporal ApplicationsProceedings of the VLDB Endowment 15 (13) (2022), 4079–4092.
     
  • Robust and Budget-Constra... - Download
    [3]
    Boissier, M. Robust and Budget-Constrained Encoding Configurations for In-Memory Database SystemsProceedings of the VLDB Endowment 15 (4) (2022), 780–793.
     
  • Evaluating Lightweight In... - Download
    [4]
    Heinzl, L., Hurdelhey, B., Boissier, M., Perscheid, M. and Plattner, H. Evaluating Lightweight Integer Compression Algorithms in Column-Oriented In-Memory DBMSInternational Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures, ADMS@VLDB (2021).
     
  • A Cockpit for the Develop... - Download
    [5]
    Kossmann, J., Boissier, M., Dubrawski, A., Heseding, F., Mandel, C., Pigorsch, U., Schneider, M., Schniese, T., Sobhani, M., Tsayun, P., Wille, K., Perscheid, M., Uflacker, M. and Plattner, H. A Cockpit for the Development and Evaluation of Autonomous Database Systems37th IEEE International Conference on Data Engineering, ICDE (2021), 2685–2688.
     
  • Joint Index, Sorting, and... - Download
    [6]
    Richly, K., Schlosser, R. and Boissier, M. Joint Index, Sorting, and Compression Optimization for Memory-Efficient Spatio-Temporal Data Management37th IEEE International Conference on Data Engineering (ICDE) (2021), 1901–1906.
     
  • Quantifying TPC-H Choke P... - Download
    [7]
    Dreseler, M., Boissier, M., Rabl, T. and Uflacker, M. Quantifying TPC-H Choke Points and Their OptimizationsProceedings of the VLDB Endowment 13 (8) (2020), 1206–1220.
     
  • Workload-Driven and Robus... - Download
    [8]
    Boissier, M. and Jendruk, M. Workload-Driven and Robust Selection of Compression Schemes for Column Stores22nd International Conference on Extending Database Technology, EDBT (2019), 674–677.
     
  • Efficient Scalable Multi-... - Download
    [9]
    Schlosser, R., Kossmann, J. and Boissier, M. Efficient Scalable Multi-Attribute Index Selection Using Recursive Strategies35th IEEE International Conference on Data Engineering, ICDE (2019), 1238–1249.
     
  • Hyrise Re-engineered: An ... - Download
    [10]
    Dreseler, M., Kossmann, J., Boissier, M., Klauck, S., Uflacker, M. and Plattner, H. Hyrise Re-engineered: An Extensible Database System for Research in Relational In-Memory Data Management22nd International Conference on Extending Database Technology (EDBT) (2019), 313–324.
     
  • Improving Box Office Resu... - Download
    [11]
    Ruhrländer, R.P., Boissier, M. and Uflacker, M. Improving Box Office Result Predictions for Movies Using Consumer-Centric ModelsProceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD (2018), 655–664.
     
  • Dynamic Pricing under Com... - Download
    [12]
    Schlosser, R. and Boissier, M. Dynamic Pricing under Competition on Online Marketplaces: A Data-Driven ApproachProceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD (2018), 705–714.
     
  • Hybrid Data Layouts for T... - Download
    [13]
    Boissier, M., Schlosser, R. and Uflacker, M. Hybrid Data Layouts for Tiered HTAP Databases with Pareto-Optimal Data Placements34th IEEE International Conference on Data Engineering, ICDE (2018), 209–220.
     

Teaching

Lectures and Seminars:

Supervised Master Theses:

  • "Workload-Driven Smooth Index and Filter Selection for In-Memory Database Scan Acceleration" (November 2022)
  • "Cost-aware Filtering in Query Processing on Serverless Cloud Infrastructure" (October 2022)
  • "Automatic Tiering in Hyrise" (September 2022)
  • "Automatic Clustering in Hyrise" (October 2020)
  • "Learned Cost Models for Query Optimization" (March 2019)
  • "Improving Cardinality Estimation and Access Avoidance in Hyrise" (November 2018)
  • "Data-Driven Ordering and Dynamic Pricing Competition on Online Marketplaces" (May 2018)
  • "Probabilistic Data Structures for In-Memory Databases" (May 2018)
  • "Maintainable and Self-Adapting Column Compression Schemes for HTAP Databases" (April 2018)
  • "Optimizing Database Scan Performance through Access Avoidance in Chunk-Based Databases using Multi-Dimensional Filters" (August 2017)
  • "Predicting movie success before release – Using individualized econometric models to predict box office performance." (January 2017)
  • "Workload-Aware Partitioning and Query Pruning for Mixed Workloads on In-Memory Databases" (January 2016)