Sedir Mohammed & Sebastian Schmidl
Title: Experiment Data Management
Abstract: As computer science researchers, we develop software systems to solve problems and execute many experiments to test and evaluate our systems. These experiments use different versions of our software, evaluate the software on many datasets, explore various settings, generate intermediate data, and hopefully produce results. Our experiments, thus, create massive amounts of experimental data, which needs to be stored and managed. In this mess, we may quickly lose track of particular results, their configuration, and whether the results are still up-to-date. How do we effectively manage our experimental data? In this talk, we propose a way to manage and query our experimental results using database technology. Based on two example setups, we show the benefits (and drawbacks) of storing experimental data within a database, such as querying using SQL, storage efficiency, access synchronization, and, in general, a better overview of all experiments and their history.