Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
 

Gerardo Vitagliano

PhD Student

Prof.-Dr.-Helmert-Straße 2-3
D-14482 Potsdam

Phone: +49 331 5509 427
Room: F-2.05

Email: Gerardo Vitagliano

Research: ResearchGate, GitHub

As a Ph.D. student at the Information Systems Group and member of the HPI Research School, my research interests are on data preparation and information extraction.

I am most active in the research projects of the Data Preparation group, currently focusing on benchmarking data loading and detecting layout templates in multiregion data files.

Feel free to contact me for collaboration, thesis proposals, or anything closely or loosely related to these research interests:

Research Interests

  • Structural Data Preparation
  • Data Pollution
  • Representation Learning
  • Layout Inference in multiregion files

Projects

  • Pollock: A benchmark for CSV data loading.
  • Mondrian: An approach for automatic recognition of layout templates in multiregion files.

Publications

  • G. Vitagliano, L. Jiang, M. Hameed, F. Naumann: Mondrian: Spreadsheet Layout Detection. In preparation
  • G. Vitagliano, L. Jiang, M. Hameed, F. Naumann: Pollock: A Data Loading Benchmark. In preparation
  • M. Hameed, G. Vitagliano, L. Jiang, F. Naumann: SURAGH: Syntactic Pattern Matching to Identify Ill-Formed Records. Proceedings of the International Conference on Extending Database Technology (EDBT) - submitted, 2022
  • L. Jiang, G. Vitagliano, M. Hameed, F. Naumann: Aggregation Detection in CSV Files. Proceedings of the International Conference on Extending Database Technology (EDBT) - submitted, 2022
  • G. Vitagliano, L. Jiang, F. Naumann: Detecting Layout Templates in Complex Multiregion Files. PVLDB. Accepted (2021).
  • L. Jiang, G. Vitagliano, F. Naumann: Structure Detection in Verbose CSV Files. Proceedings of the International Conference on Extending Database Technology (EDBT), 2021
  • L. Jiang, G. Vitagliano, F. Naumann: A Scoring-based Approach for Data Preparator Suggestion. Lernen, Wissen, Daten, Analysen (LWDA), 2019

Teaching