We are excited to announce that the paper "JET: Fast Estimation of Hierarchical Time Series Clustering" was accepted to be presented at the 10th International conference on Time Series and Forecasting (ITISE).
Authors:
Phillip Wenig (Hasso Plattner Institute)
Mathias Höfgen (Rolls-Royce)
Thorsten Papenbrock (Philipps University of Marburg)
Abstract:
Clustering is an effective, unsupervised classification approach for time series analysis applications that suffer a natural lack of training data. One such application is the development of jet engines that involves numerous test runs and failure detection processes. While effective data mining algorithms exist for the detection of anomalous and structurally conspicuous test recordings, these algorithms do not perform any semantic labelling. So data analysts spend many hours connecting the large amounts of automatically extracted observations to their underlying root causes. The complexity, amount and variety of extracted time series make this task hard not only for humans, but also for existing time series clustering algorithms: These algorithms either require training data for supervised learning, cannot deal with varying time series lengths, or suffer from exceptionally long runtimes. In this paper, we propose JET, an unsupervised, highly efficient clustering algorithm for large amounts of variable-lengths time series. The main idea is to transform the input time series into a metric space, then apply a very fast conventional clustering algorithm to obtain an effective, but rather coarse-grained pre-clustering of the data; this pre-clustering serves to subsequently estimate the more accurate but also more costly shape-based distances of the time series and, thus, enables JET to apply a highly effective Hierarchical Clustering algorithm on the entire input time series collection. Our experiments demonstrate that JET is highly accurate and much faster than its competitors.