Lan Jiang

Former Ph.D. student at the Information Systems Research Group

Contact Information

Contact Information

Prof.-Dr.-Helmert-Straße 2-3
D-14482 Potsdam
Room: F-2.05

Phone: +49 331 5509 1349

Email: Lan Jiang

Research Interests

Data Preparation
Metadata Detection
Data Profiling

Teaching

Seminars:

Advanced Data Profiling (WS2017/18)
Data Preparation for Science (WS2018/19)

Master thesis:

ExtracTable: Extracting Tables From Plain-Text Files (Leonardo Hübscher, 2021)

Publications

Lan Jiang, Gerardo Vitagliano, Mazhar Hameed, Felix Naumann: "Aggregation Detection in CSV Files". Proceedings of the International Conference on Extending Database Technology (EDBT), 2022 (to appear)
Mazhar Hameed, Gerardo Vitagliano, Lan Jiang, Felix Naumann: "SURAGH: Syntactic Pattern Matching to Identify Ill-Formed Records". Proceedings of the International Conference on Extending Database Technology (EDBT), 2022 (under revision)
Gerardo Vitagliano, Lan Jiang, Felix Naumann: "Detecting Layout Templates in Complex Multiregion Files". PVLDB. Accepted (2021).
Lan Jiang, Gerardo Vitagliano, Felix Naumann: "Structure Detection in Verbose CSV Files". Proceedings of the International Conference on Extending Database Technology (EDBT), 193–204, 2021
Koumarelas, Ioannis, Lan Jiang, and Felix Naumann. "Data Preparation for Duplicate Detection". Journal of Data and Information Quality (JDIQ) 12, no. 3 (2020): 1–24.
Lan Jiang, and Felix Naumann. "Holistic Primary Key and Foreign Key Detection". Journal of Intelligent Information Systems 54, no. 3 (2020): 439–461.
Lan Jiang, Gerardo Vitagliano, Felix Naumann: "A Scoring-based Approach for Data Preparator Suggestion". Lernen, Wissen, Daten, Analysen (LWDA), 2454:6–9, 2019
Dürsch, Falco, Axel Stebner, Fabian Windheuser, Maxi Fischer, Tim Friedrich, Nils Strelow, Tobias Bleifuß, Hazar Harmouch, Lan Jiang, Thorsten Papenbrock, and Felix Naumann. "Inclusion Dependency Discovery: An Experimental Evaluation of Thirteen Algorithms". In Proceedings of the International Conference on Information and Knowledge Management (CIKM), 219–228, 2019.
Lan Jiang, Hengyang Lu, Ming Xu, and Chongjun Wang. “Biterm Pseudo Document Topic Model for Short Text”. In IEEE International Conference on Tools With Artificial Intelligence, 865–872. IEEE, 2016
Yang Jun, Lan Jiang, Chongjun Wang, and Junyuan Xie. “Multi-Label Emotion Classification for Tweets in Weibo: Method and Application”. In IEEE International Conference on Tools With Artificial Intelligence, 424-428, IEEE, 2014.

Lan Jiang

Contact Information

Research Interests

Teaching

Publications

Chair

News

06.10.2024 | Paper accepted at EDBT 2025

06.09.2024 | Congratulations Dr. Phillip Wenig

06.09.2024 | Congratulations Dr. Mazhar Hameed!

16.07.2024 | Congratulations Dr. Leon Bornemann-Paulus!

23.05.2024 | Paper accepted at NLDB 2024

Project highlights

People and open positions