Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
 

Contact Information
Prof.-Dr.-Helmert-Straße 2-3 D-14482 Potsdam Room: F-2.05
Phone: +49 331 5509 274
Email: mazhar.hameed(at)hpi.de

Reserach Interests

  • Data Preparation
  • Dialect Detection & Correction
  • Record Structure Detection & Preparation
  • File Structure Detection & Preparation

Projects

  • Survey -  Data preparation from industry perspective: A survey
  • Suragh -  Detecting ill-formed Records in CSV Files
  • Tasheeh - Cleaning ill-formed Records in CSV Files

Publications

  • M. Hameed, G. Vitagliano, F. Panse, F. Naumann: TASHEEH: Repairing Row-Structure in Raw CSV Files, Proceedings of the International Conference on Extending Database Technology (EDBT), 2024
  • M. Hameed, G. Vitagliano, F. Naumann: MORPHER: Structural Transformation of ill-formed Rows, Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2023
  • G. Vitagliano, M. Hameed, L. Reisener, L. Jiang, E. Wu, F. Naumann: Pollock: A Data Loading Benchmark, Proceedings of the VLDB Endowment (PVLDB), 2023.
  • G. Vitagliano, M. Hameed, F. Naumann: Structural embedding of data files with MaGRiTTE. Table Representation Learning Workshop at NeurIPS (TRL@NeurIPS), 2022.
  • G. Vitagliano, L. Reisener, L. Jiang, M. Hameed, F. Naumann: Mondrian: Spreadsheet Layout Detection. Proceedings of the International Conference on Management of Data (SIGMOD), 2022 
  • M. Hameed, G. Vitagliano, L. Jiang, F. Naumann: SURAGH: Syntactic Pattern Matching to Identify Ill-Formed Records.  Proceedings of the International Conference on Extending Database Technology (EDBT), 2022 
  • L. Jiang, G. Vitagliano, M. Hameed, F. Naumann: Aggregation Detection in CSV Files.  Proceedings of the International Conference on Extending Database Technology (EDBT), 2022 
  • M. Hameed, F. Naumann: Data Preparation: A Survey of Commercial Tools.  SIGMOD Record 49:(3), 2020