Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
 

05.02.2018

Data Quality – The Role of Empiricism

Shazia Sadiq, Juliana Freire, Renée J. Miller, Tamraparni Dasu, Ihab F. Ilyas, Felix Naumann, Divesh Srivastava, Xin Luna Dong, Sebastian Link, Xiaofang Zhou

Our paper "Data Quality – The Role of Empiricism" has been accepted for publication at SIGMOD Record December 2017, Vol. 46, No. 4. 

ABSTRACT

We outline a call to action for promoting empiricism indata quality research. The action points result from ananalysis of the landscape of data quality research. Thelandscape exhibits two dimensions of empiricism indata quality research relating to type of metrics andscope of method. Our study indicates the presence of adata continuum ranging from real to synthetic data,which has implications for how data quality methodsare evaluated. The dimensions of empiricism and theirinter-relationships provide a means of positioning dataquality research, and help expose limitations, gaps andopportunities.