Hasso-Plattner-Institut
Prof. Dr. h.c. Hasso Plattner
  
 

Martin Boissier

Research Assistant, PhD Candidate

Martin BoissierPhone: +49 (331) 5509 - 1330
Fax: +49 (331) 97992 - 579
E-Mail: martin.boissier(at)hpi.de
Website:https://martin.boissier.de
Room: Hasso-Plattner-Villa, V 2.05

Research

Main Memory Footprint Reduction of In-Memory Database Systems

Modern relational database systems keep data resident in main memory. While this enables high  runtime performance, the required computational resources also incur a high TCO. It is thus desirable to reduce the main memory footprint while at the same time retaining the performance superiority over disk-based systems. In the spectrum of a fully DRAM-resident to a disk-resident database system, the goal is to find a configuration with the maximum runtime performance for a given memory budget. Existing approaches focus either on analytical or transactional systems. However, for OLxP workloads, such a reduction is an unsolved challenge for which existing methods are insufficient.
We propose methods for OLxP database systems, which degrade gracefully with decreasing memory budgets and adapt dynamically to changing workloads. To estimate a configuration’s impact before actually applying it, we build learned performance estimators that allow us to generate robust configurations.
The actual footprint reduction process is divided into three aspects. First, we reduce existing allocations by removing inefficient secondary indices and applying workload-driven compression configurations. Second, we use hybrid table layouts that evict infrequently accessed data to secondary storage tiers. Third, we employ auxiliary data structures that eliminate most unnecessary accesses to secondary storage. This mitigates the negative effects of tiering data to slower storage tiers.
We show that access patterns often seen in real-world systems allow reducing the footprint significantly with neglectable performance losses. For very small memory budgets, the auxiliary data structures can avoid the majority of accesses to slower storage tiers.

Publications

  • Workload-Driven and Robus... - Download
    Boissier, M., Jendruk, M.: Workload-Driven and Robust Selection of Compression Schemes for Column Stores. 22nd International Conference on Extending Database Technology (EDBT). pp. 674-677 (2019).
     
  • Hyrise Re-engineered: An ... - Download
    Dreseler, M., Kossmann, J., Boissier, M., Klauck, S., Uflacker, M., Plattner, H.: Hyrise Re-engineered: An Extensible Database System for Research in Relational In-Memory Data Management. 22nd International Conference on Extending Database Technology (EDBT). pp. 313-324 (2019).
     
  • Automated Repricing and O... - Download
    Schlosser, R., Walther, C., Boissier, M., Uflacker, M.: Automated Repricing and Ordering Strategies in Competitive Markets. AI Communications. 32, 15-29 (2019).
     
  • Efficient Scalable Multi-... - Download
    Schlosser, R., Kossmann, J., Boissier, M.: Efficient Scalable Multi-Attribute Index Selection Using Recursive Strategies. IEEE 35th International Conference on Data Engineering (ICDE 2019). pp. 1238-1249. IEEE (2019).
     
  • Reducing the Footprint of... - Download
    Boissier, M.: Reducing the Footprint of Main Memory HTAP Systems: Removing, Compressing, Tiering, and Ignoring Data. PhD Workshop at VLDB. CEUR-WS.org (2018).
     
  • Dealing with the Dimensio... - Download
    Schlosser, R., Boissier, M.: Dealing with the Dimensionality Curse in Dynamic Pricing Competition: Using Frequent Repricing to Compensate Imperfect Market Anticipations. Computers and Operations Research. 100, 26-42 (2018).
     
  • Improving Box Office Resu... - Download
    Ruhrländer, R.P., Boissier, M., Uflacker, M.: Improving Box Office Result Predictions for Movies Using Consumer-Centric Models. KDD '18 Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. pp. 655-664 (2018).
     
  • Dynamic Pricing under Com... - Download
    Schlosser, R., Boissier, M.: Dynamic Pricing under Competition on Online Marketplaces: A Data-Driven Approach. KDD '18 Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. pp. 705-714 (2018).
     
  • Data-Driven Inventory Man... - Download
    Schlosser, R., Walther, C., Boissier, M., Uflacker, M.: Data-Driven Inventory Management and Dynamic Pricing Competition on Online Marketplaces. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI 2018). pp. 5856-5858 (2018).
     
  • Workload-Driven Horizonta... - Download
    Boissier, M., Kurzynski, D.: Workload-Driven Horizontal Partitioning and Partition Pruning for Large HTAP Systems. 2018 IEEE 34th International Conference on Data Engineering Workshops. pp. 116-121 (2018).
     
  • Hybrid Data Layouts for T... - Download
    Boissier, M., Schlosser, R., Uflacker, M.: Hybrid Data Layouts for Tiered HTAP Databases with Pareto-Optimal Data Placements. IEEE 34th International Conference on Data Engineering (ICDE 2018). pp. 209-220 (2018).
     
  • Optimal Repricing Strateg... - Download
    Schlosser, R., Boissier, M.: Optimal Repricing Strategies in a Stochastic Infinite Horizon Duopoly. Communications in Computer and Information Science (CCIS). pp. 129-150. Springer (2018).
     
  • Improving Materialization... - Download
    Boissier, M., Spivak, A., Meyer, C.: Improving Materialization for Tiered Column Stores: A Workload-Aware Ansatz Based on Table Reordering. ACSW '17 Proceedings of the Australasian Computer Science Week Multiconference, ACSW '17. pp. 25:1-25:10. ACM, New York, NY, USA (2017).
     
  • Detecting Fraudulent Adve... - Download
    Zimmermann, T., Djürken, T., Mayer, A., Janke, M., Boissier, M., Schwarz, C., Schlosser, R., Uflacker, M.: Detecting Fraudulent Advertisements on a Large E-Commerce Platform. Proceedings of the Nineteenth International Workshop on Data Warehousing and OLAP, DOLAP, Venice, Italy, March 21, 2017 (2017).
     
  • Data-Driven Repricing Str... - Download
    Boissier, M., Schlosser, R., Podlesny, N., Serth, S., Bornstein, M., Latt, J., Lindemann, J., Selke, J., Uflacker, M.: Data-Driven Repricing Strategies in Competitive Markets: An Interactive Simulation Platform. Proceedings of the Eleventh ACM Conference on Recommender Systems (RecSys '17). pp. 355-357. ACM, New York, NY, USA (2017).
     
  • An Interactive Platform t... - Download
    Serth, S., Podlesny, N., Bornstein, M., Lindemann, J., Latt, J., Selke, J., Schlosser, R., Boissier, M., Uflacker, M.: An Interactive Platform to Simulate Dynamic Pricing Competition on Online Marketplaces. 21st IEEE International Enterprise Distributed Object Computing Conference, EDOC 2017, Quebec City, QC, Canada, October 10-13, 2017. pp. 61-66. IEEE (2017).
     
  • Optimal Price Reaction St... - Download
    Schlosser, R., Boissier, M.: Optimal Price Reaction Strategies in the Presence of Active and Passive Competitors. Proceedings of the 6th International Conference on Operations Research and Enterprise Systems (ICORES), Porto, Portugal. pp. 47-56 (2017).
     
  • How To Survive Dynamic Pr... - Download
    Schlosser, R., Boissier, M., Schober, A., Uflacker, M.: How To Survive Dynamic Pricing Competition in E-commerce. Proceedings of the Poster Track of the 10th ACM Conference on Recommender Systems (RecSys 2016), Boston, USA, September 17, 2016 (2016).
     
  • Analyzing Data Relevance ... - Download
    Boissier, M., Meyer, C., Djürken, T., Lindemann, J., Mao, K., Reinhardt, P., Specht, T., Zimmermann, T., Uflacker, M.: Analyzing Data Relevance and Access Patterns of Live Production Database Systems. Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 2016. p. 2473--2475. ACM, New York, NY, USA (2016).
     
  • Footprint Reduction and U... - Download
    Faust, M., Boissier, M., Keller, M., Schwalb, D., Bischoff, H., Eisenreich, K., Färber, F., Plattner, H.: Footprint Reduction and Uniqueness Enforcement with Hash Indices in SAP HANA. Database and Expert Systems Applications: 27th International Conference, DEXA 2016, Porto, Portugal, September 5-8, 2016, Proceedings, Part II. p. 137--151 (2016).
     
  • A Cost-Aware and Workload... - Download
    Boissier, M., Djürken, T., Schlosser, R., Faust, M.: A Cost-Aware and Workload-Based Index Advisor for Columnar In-Memory Databases. 22nd International Conference, ICIST 2016, Druskininkai, Lithuania, October 13-15, 2016, Proceedings, CCIS 639. p. 285--299 (2016).
     
  • And all of a sudden: Main... - Download
    Boissier, M., Meyer, C., Uflacker, M., Tinnefeld, C.: And all of a sudden: Main Memory Is Less Expensive Than Disk. In: Rabl, T., Sachs, K., Poess, M., K. Baru, C., and Jacobsen, H.-A. (eds.) Big Data Benchmarking. pp. 132-144. Springer International Publishing (2015).
     
  • Dynamic and Transparent D... - Download
    Meyer, C., Boissier, M., Michaud, A., Vollmer, J.O., Taylor, K., Schwalb, D., Uflacker, M., Roedszus, K.: Dynamic and Transparent Data Tiering for In-Memory Databases in Mixed Workload Environments. International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures - ADMS @ VLDB 2015 (2015).
     
  • Optimizing Main Memory Ut... - Download
    Boissier, M.: Optimizing Main Memory Utilization of Columnar In-Memory Databases Using Data Eviction. Proceedings of Phd Workshop @ VLDB 2014, Hangzhou (2014).
     
  • An Integrated Data Manage... - Download
    Boissier, M., Krüger, J., Wust, J., Plattner, H.: An Integrated Data Management for Enterprise Systems. ICEIS 2014 - Proceedings of the 16th International Conference on Enterprise Information Systems. pp. 410-418 (2014).
     
  • Main Memory Databases for... - Download
    Krüger, J., Hübner, F., Wust, J., Boissier, M., Zeier, A., Plattner, H.: Main Memory Databases for Enterprise Applications. IEEE 18Th International Conference on Industrial Engineering and Engineering Management (IE&EM), 2011 (2011).
     
  • Data Structures for Mixed... - Download
    Krüger, J., Grund, M., Boissier, M., Zeier, A., Plattner, H.: Data Structures for Mixed Workloads in In-Memory Databases. 5th International Conference on Computer Sciences and Convergence Information Technology (ICCIT), 2010 (2010).