Data Quality – The Role of Empiricism
Shazia Sadiq, Juliana Freire, Renée J. Miller, Tamraparni Dasu, Ihab F. Ilyas, Felix Naumann, Divesh Srivastava, Xin Luna Dong, Sebastian Link, Xiaofang Zhou
Our paper "Data Quality – The Role of Empiricism" has been accepted for publication at SIGMOD Record December 2017, Vol. 46, No. 4.
We outline a call to action for promoting empiricism indata quality research. The action points result from ananalysis of the landscape of data quality research. Thelandscape exhibits two dimensions of empiricism indata quality research relating to type of metrics andscope of method. Our study indicates the presence of adata continuum ranging from real to synthetic data,which has implications for how data quality methodsare evaluated. The dimensions of empiricism and theirinter-relationships provide a means of positioning dataquality research, and help expose limitations, gaps andopportunities.