Hasso-Plattner-Institut
Prof. Dr. Tobias Friedrich
 

Lossy Compression of Time Series Data

Bachelor Project - Winter 2018/19 and Summer 2019

Photo credit: HPI/K. Herschelmann

The bachelor project is a cornerstone for the praxis-related study at the Hasso Plattner Institute. Central to this project is a group of students cooperating in developing a solution for a software-project given by an industrial partner. For the bachelor project 2018/19, we work with the Industrial Analytics IA GmbH.

 

Problem and Motivation

Time series data is data derived from consecutive measurements over time. It appears in all domains of applied science and engineering. Algorithms for processing such data face the additional challenge of exploiting the linear structure of the data rather than treating the time component as a mere additional data value.

Storing time series data can quickly overload any available storage. The random noise introduced by physical measurements fundamentally limits the achievable compression rate. For an effective compression of high-frequency data, we are therefore forced to apply lossy compression schemes.

We want to evaluate existing algorithms and develop new algorithms for lossy compression schemes for high-frequency time series data from different domains. The key question is what compression can be achieved without losing too much information while still being able to analyze the data and using it for modeling and statistical predictions. The scientific method will be two-fold:

  1.  an empirical study of different methods on real-world data;
  2.  a thorough analysis of the theoretical limitations;

Our project partner Industrial Analytics IA GmbH is a young IoT company with a scientific background in physics and mechanical engineering. They supply the team with high-dimensional industrial data sets spanning a period of several years. The data has been obtained from physical measurements of large industrial machines. For understanding the time series data, the partner company provides details of the particular industrial machines and the underlying thermodynamics. During the project, they will support the team with the required background in mechanical engineering and industrial data analytics.

Results

The work of this bachelor project was presented at the Bachelorpodium 2019. Some of the results have been published at the 27th International Conference on Neural Information Processing (ICONIP 2020).

  • Memetic Genetic Algorithm... - Download
    Friedrich, Tobias; Krejca, Martin S.; Lagodzinski, J. A. Gregor; Rizzo, Manuel; Zahn, Arthur Memetic Genetic Algorithms for Time Series Compression by Piecewise Linear ApproximationInternational Conference on Neural Information Processing (ICONIP) 2020: 592–604
     

Project Team

The bachelor project is organized by the Algorithm Engineering group. The following group members and students are participating:

Project Supervisor

Hasso Plattner Institute

Office: A-1.10
Tel.: +49 331 5509-410
E-Mail: Friedrich(at)hpi.de

Project Supervisor

Hasso Plattner Institute

Office: A-1.12
Tel.: +493315509-418
E-Mail: Timo.Koetzing@hpi.de

Project Supervisor

 

Hasso Plattner Institute

Office: A-1.11
Tel.: +493315509-4841
E-Mail: Manuel.Rizzo@hpi.de

Advisor

Hasso Plattner Institute

Office: A-1.13
Tel.: +49 331 5509-416
E-Mail: Martin.Schirneck(at)hpi.de

Jorin Heide

Participant

Hasso Plattner Institute

E-Mail: Jorin.Heide(at)student.hpi.de

Linus Heinzl

Participant

Hasso Plattner Institute

E-Mail: Linus.Heinzl(at)student.hpi.de

Nicolas Klodt

Participant

Hasso Plattner Institute

E-Mail: Nicolas.Klodt(at)student.hpi.de

Felix Mujkanovic

Participant

Hasso Plattner Institute

E-Mail: Felix.Mujkanovic@student.hpi.de

Lars Seifert

Participant

Hasso Plattner Institute

E-Mail: Lars.Seifert(at)student.hpi.de

Arthur Zahn

Participant

Hasso Plattner Institute

E-Mail: Arthur.Zahn(at)student.hpi.de