Hasso-Plattner-Institut
  
Hasso-Plattner-Institut
Prof. Dr. h.c. Hasso Plattner
  
 

Projects Overview

Environmental Monitoring

Air quality is an important factor for human quality of life. As a result, many governments are creating guidelines concerning emissions. To achieve better air quality, governments, industry and infrastructure providers have to work together to implement effective pollution management programs

A core requirement for these programs is the effective assessment of the geographical distribution of emissions and their sources. This allows environmental experts to determine the best policies and most effective locations for emission mitigating infrastructure. Read more.

Contact: Günter Hesse, Markus DreselerDr. Matthias Uflacker

Research Area: In-Memory Data Management for Enterprise Systems


Dynamic Tour Planning

Sales representatives (reps) maximize sales profitability by selling goods and services. They visit retail stores within an assigned territory to fully exploit the sales potential for the products of their represented company. During their visits the sales reps record and optimize product placement, install advertising displays, check products for compliance, i.e. whether the store offers them to buy, as well as out-of stock situations, and talk to the stores’ managers to improve the product representation within the store. Regularly, sales reps have to schedule their store visits for the upcoming time frame with the goal to choose the “right” stores, which are supposed to increase the sales profitability as much as possible. Until now, this planning is done manually based on static and aggregated data provided by spreadsheets from the rep’s manager. Read more.

Contact: Martin FaustStefan Klauck, David SchwalbDr. Matthias Uflacker

Research Area: In-Memory Data Management for Enterprise Systems 


HPI Business Simulator

Today’s reporting offers unprecedented flexibility. Companies can dive into their data, filter for criteria, and drill down into hierarchies to explore their data live and on line item level. Companies wish to exploit this flexibility not only for reporting but also for forecasting and simulation. They want to define potential future scenarios and calculate how these influence their businesses. Exploiting SAP HANA and our Aggregate Cache technology, what-if analyses can be modeled and run efficiently by means of interactively defined simulation scenarios that are calculated on the fly. In that way, the analyses can not only support the monthly budgeting process but also day-to-day decision-making and simplified planning. Read more.

Contact: Stefan KlauckDr. Matthias Uflacker

Research Area: In-Memory Data Management for Enterprise Systems


HANA Load Simulator

Screenshot of the running HANA Load Simulator

The HANA Load Simulator creates a realistic enterprise workload of thousands of concurrent users and executes that workload on different database configurations simultaneously. A dashboard monitors several performance indicators of each database, incl. data footprint, transaction latencies, throughput, and overall CPU utilization. The dashboard can also be used to configure several workload parameters like OLTP and OLAP query frequencies or the ratio of actual and historical queries. This provides a simple and interactive tool to assess key performance characteristics of different database setups (e.g., single- vs. multi-node) side-by-side and in real-time. Read more.

Contact: Martin Boissier, Carsten MeyerDr. Matthias Uflacker

Research Area: In-Memory Data Management for Enterprise Systems


SORMAS

In October 2014, the Helmholtz Center for Infectious Diseases, Robert Koch Institute, Bernhard Nocht Institute, Nigeria Field Epidemiology and Laboratory Training Program (NFELTP), Hasso Plattner Institute, and SAP consolidated their efforts and expertise in an interdisciplinary committee to build the Surveillance Outbreak and Response Management System (SORMAS), a management tool to support identifying emerging infections and suspected cases as well as their contacts and leveraging immediate information exchange between all involved parties of outbreak control. In order to meet the specific technical requirements of West African countries, SORMAS consists of applications for both desktop PCs and Android smartphones that are connected to a central data management platform. Read more.

Contact: Cindy PerscheidDr. Matthieu-P. Schapranow, Dr. Matthias Uflacker

Research Area: In-Memory Data Management for Life Sciences


Dynamic Aggregates Caching

The mixed database workloads of enterprise applications are comprised of short-running transactional as well as analytical queries with resource-intensive data aggregations. In this context, caching the query results of long-running aggregate queries is desirable as it increases the overall performance. In-memory databases with a main-delta architecture are optimized for a new caching mechanism for aggregate queries which is the main contribution of this ongoing research project. With the separation into main and delta storage, cached aggregates do not have to be invalidated when new data is inserted to the delta storage. Instead, we can use the cached aggregate query result and combine it with the newly added records in the delta storage. Read more.

Contact: Stephan Müller

Research Area: In-Memory Data Management for Enterprise Systems


HYRISE

Traditional databases are separated into ones for current data from the day-to-day business processes and ones for reporting and analytics. For fast moving businesses, moving data from one silo to another is cumbersome and takes too much time. As a result, the new data arriving in the reporting system is already old by the time it is loaded. HYRISE proposes a new way to solve this problem: It analyzes the query input and reorganizes the stored data in different dimensions.  In detail, HYRISE partitions the layout of the underlying tables in a vertical and horizontal manner depending on the input to this layout management component. The workload is specified as a set of queries and weights and is processed by calculating the layout dependent costs for those queries. Based on our cost-model we can now calculate the best set of partitions for this input workload. This optimization allows great speed improvements compared to traditional storage models.

Contact: David Schwalb

Research Area: In-Memory Data Management for Enterprise Systems


High-Performance In-Memory Genome (HIG) Project

The continuous progress in understanding relevant genomic basics, e.g. for treatment of cancer patients, collides with the tremendous amount of data, that need to be processed. For example, the human genome consists of approx. 3.2 billion base pairs resp. 3.2 GB of data. Identifying a concrete sequence of 20 base pairs within the genome takes hours to days if performed manually. Processing and analyzing genomic data is a challenge for medical and biological research that delays progress of research projects. From a software engineering point of view, improving the analysis of genomic data is both a concrete research and engineering challenge. Combining knowledge of in-memory technology and of how to perform real-time analysis of huge amount of data with concrete research questions of medical and biological experts is the aim of the HIG project. Read more.

Contact: Dr. Matthieu Schapranow, Cindy Perscheid

Research Area: In-Memory Data Management for Life Sciences


Interactive Tactic-Board

In recent years the use of geo-spatial data increased strongly in various areas. Especially in the highly competitive sports sector new insights gained by positional information of players – tracked by camera or sensor based systems during a game – can have a major impact on the training and tactic of a team.  In contrast to current applications, which focus solely on the analysis and visualization of basic metrics like the run distance or the average position of a player, the interactive tactic board enables the analysis of complex tactical patterns. For coaches and video analysts the analysis of game recordings is an important step during the preparation and post-processing of games. They extract strength and weaknesses of their teams as well as opponents by manually analyzing the video recordings of past games. Since video recordings are an unstructured data source, it is a complex and time intensive task to find specific game situations or similar patterns in the recordings. Read more.

Contact: Keven RichlyDr. Matthias Uflacker


Natural Language Processing in In-Memory Databases

The current data deluge demands fast and real-time processing of large datasets to support various applications, also for textual data, such as scientific publications, Web pages or messages in the social media. Natural language processing (NLP) is the field of automatically processing textual documents and includes a variety of tasks such as tokenization (delimitation of words), part-of-speech tagging (assignment of syntactic categories to words), chunking (delimitation of phrases) and syntactic parsing (construction of syntactic tree for a sentence). Read more.

Contact: Dr. Mariana NevesDr. Matthias Uflacker


Project Archive

Find a list of previous projects here.