Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
  
 

Publications (sorted in inverse chronological order)

2021

  • Multifaceted Domain-Speci... - Download
    1.
    Risch, J., Hager, P., Krestel, R.: Multifaceted Domain-Specific Document Embeddings. Proceedings of the 19th Annual Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations)(NAACL). ACL (2021).
     
  • ComEx: Comment Exploratio... - Download
    2.
    Risch, J., Repke, T., Kohlmeyer, L., Krestel, R.: ComEx: Comment Exploration on Online News Platforms. Joint Proceedings of the ACM IUI 2021 Workshops co-located with the 26th ACM Conference on Intelligent User Interfaces (IUI). pp. 1–7. CEUR-WS.org (2021).
     
  • PatentMatch: A Dataset fo... - Download
    3.
    Risch, J., Alder, N., Hewel, C., Krestel, R.: PatentMatch: A Dataset for Matching Patent Claims & Prior Art. Proceedings of the 2nd Workshop on Patent Text Mining and Semantic Technologies (PatentSemTech@SIGIR) (2021).
     
  • Evaluation of Duplicate D... - Download
    4.
    Panse, F., Naumann, F.: Evaluation of Duplicate Detection Algorithms: From Quality Measures to Test Data Generation (tutorial). Proceedings of the International Conference on Data Engineering (ICDE). pp. 2373–2376 (2021).
     
  • Data Integration for Toxi... - Download
    5.
    Risch, J., Schmidt, P., Krestel, R.: Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in One Unified Format. Proceedings of the Workshop on Online Abuse and Harms (WOAH@ACL). pp. 157–163 (2021).
     
  • Do Embeddings Actually Ca... - Download
    6.
    Jain, N., Kalo, J.-C., Balke, W.-T., Krestel, R.: Do Embeddings Actually Capture Knowledge Graph Semantics?. Extended Semantic Web Conference (ESWC) 2021. pp. 143–159. Springer (2021).
     
  • Structured Object Matchin... - Download
    7.
    Bleifuß, T., Bornemann, L., Kalashnikov, D.V., Naumann, F., Srivastava, D.: Structured Object Matching across Web Page Revisions. IEEE International Conference on Data Engineering (ICDE). pp. 1284–1295 (2021).
     
  • 8.
    Loster, M., Mottin, D., Papotti, P., Naumann, F., Ehmueller, J., Feldmann, B.: Few-Shot Knowledge Validation using Rules. Proceedings of the Web Conference (2021).
     
  • Structure Detection in Ve... - Download
    9.
    Jiang, L., Vitagliano, G., Naumann, F.: Structure Detection in Verbose CSV Files. International Conference on Extending Database Technology (EDBT). pp. 193–204 (2021).
     
  • Robust Visualisation of D... - Download
    10.
    Repke, T., Krestel, R.: Robust Visualisation of Dynamic Text Collections: Measuring and Comparing Dimensionality Reduction Algorithms. ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR). pp. 1–4 (2021).
     
  • Extraction and Representa... - Download
    11.
    Repke, T., Krestel, R.: Extraction and Representation of Financial Entities from Text. In: Consoli, S., Reforgiato Recupero, D., and Saisana, M. (eds.) Data Science for Economics and Finance. pp. 241–263. Springer, Cham (2021).
     
  • Optimized Theta-Join Proc... - Download
    12.
    Weise, J., Schmidl, S., Papenbrock, T.: Optimized Theta-Join Processing. In: Sattler, K.-U., Herschel, M., and Lehner, W. (eds.) Proceedings of the Conference on Database Systems for Business, Technology, and Web (BTW). pp. 59–78. Gesellschaft für Informatik, Bonn (2021).
     
  • 13.
    Harmouch, H., Papenbrock, T., Naumann, F.: Relational Header Discovery using Similarity Search in a Table Corpus. IEEE 37th International Conference on Data Engineering (ICDE). (2021).
     
  • Data dependencies for que... - Download
    14.
    Kossmann, J., Papenbrock, T., Naumann, F.: Data dependencies for query optimization: a survey. VLDB Journal. (2021).
     
  • 15.
    Schneider, J., Wenig, P., Papenbrock, T.: Distributed detection of sequential anomalies in univariate time series. The International Journal on Very Large Data Bases. (2021).
     
  • Ein Data Engineering Kurs... - Download
    16.
    Alder, N., Bleifuß, T., Bornemann, L., Naumann, F., Repke, T.: Ein Data Engineering Kurs für 10.000 Teilnehmer. Datenbank-Spektrum. 20, 5–9 (2021).
     
  • Modeling the Evolution of... - Download
    17.
    Schwanhold, R., Repke, T., Krestel, R.: Modeling the Evolution of Word Senses with Force-Directed Layouts of Co-occurrence Networks. Proceedings of the 2nd International Workshop on Computational Approaches to Historical Language Change (LChange@ACL 2021). 1–6 (2021).
     
  • 18.
    Belaid, M.K., Rabus, M., Krestel, R.: CrashNet: an encoder–decoder architecture to predict crash test outcomes. Data Min Knowl Disc. (2021).
     
  • 19.
    Loster, M., Koumarelas, I., Naumann, F.: Knowledge Transfer for Entity Resolution with Siamese Neural Networks. Journal of Data and Information Quality. 13, (2021).
     
  • 20.
    Schmidl, S., Papenbrock, T.: Efficient Distributed Discovery of Bidirectional Order Dependencies. The VLDB Journal. (2021).
     

2020

  • Bagging BERT Models for R... - Download
    1.
    Risch, J., Krestel, R.: Bagging BERT Models for Robust Aggression Identification. Proceedings of the Workshop on Trolling, Aggression and Cyberbullying (TRAC@LREC). pp. 55–61. European Language Resources Association (ELRA) (2020).
     
  • HyCoNN: Hybrid Cooperativ... - Download
    2.
    Risch, J., Künstler, V., Krestel, R.: HyCoNN: Hybrid Cooperative Neural Networks for Personalized News Discussion Recommendation. Proceedings of the International Joint Conferences on Web Intelligence and Intelligent Agent Technologies (WI-IAT) (2020).
     
  • Semantic Analysis of Cult... - Download
    3.
    Jain, N., Bartz, C., Bredow, T., Metzenthin, E., Otholt, J., Krestel, R.: Semantic Analysis of Cultural Heritage Data: Aligning Paintings and Descriptions in Art-Historic Collections. International Workshop on Fine Art Pattern Extraction and Recognition in conjunction with the 25th International Conference on Pattern Recognition (ICPR 2020) (2020).
     
  • Discovering Biased News A... - Download
    4.
    Lazaridou, K., Löser, A., Mestre, M., Naumann, F.: Discovering Biased News Articles Leveraging Multiple Human Annotations. Proceedings of the Conference on Language Resources and Evaluation (LREC). pp. 1268–1277 (2020).
     
  • Domain-Specific Knowledge... - Download
    5.
    Jain, N.: Domain-Specific Knowledge Graph Construction for Semantic Analysis. Extended Semantic Web Conference (ESWC 2020) Ph.D. Symposium (2020).
     
  • Hierarchical Document Cla... - Download
    6.
    Risch, J., Garda, S., Krestel, R.: Hierarchical Document Classification as a Sequence Generation Task. Proceedings of the Joint Conference on Digital Libraries (JCDL). pp. 147–155 (2020).
     
  • Offensive Language Detect... - Download
    7.
    Risch, J., Ruff, R., Krestel, R.: Offensive Language Detection Explained. Proceedings of the Workshop on Trolling, Aggression and Cyberbullying (TRAC@LREC). pp. 137–143. European Language Resources Association (ELRA) (2020).
     
  • Efficient Detection of Da... - Download
    8.
    Pena, E.H.M., Filho, E.R.L., de Almeida, E.C., Naumann, F.: Efficient Detection of Data Dependency Violations. Proceedings of the International Conference on Information and Knowledge Management (CIKM) (2020).
     
  • A Dataset of Journalists'... - Download
    9.
    Risch, J., Krestel, R.: A Dataset of Journalists’ Interactions with Their Readership: When Should Article Authors Reply to Reader Comments?. Proceedings of the International Conference on Information and Knowledge Management (CIKM). pp. 3117–3124. ACM (2020).
     
  • Natural Key Discovery in ... - Download
    10.
    Bornemann, L., Bleifuß, T., Kalashnikov, D.V., Naumann, F., Srivastava, D.: Natural Key Discovery in Wikipedia Tables. Proceedings of The World Wide Web Conference (WWW). pp. 2789–2795 (2020).
     
  • Dynamic Channel and Layer... - Download
    11.
    Bejnordi, A.E., Krestel, R.: Dynamic Channel and Layer Gating in Convolutional Neural Networks. Proceedings of the 43rd German Conference on Artificial Intelligence (KI 2020) (2020).
     
  • Sense Tree: Discovery of ... - Download
    12.
    Ehmüller, J., Kohlmeyer, L., McKee, H., Paeschke, D., Repke, T., Krestel, R., Naumann, F.: Sense Tree: Discovery of New Word Senses with Graph-based Scoring. Proceedings of the Conference on "Lernen, Wissen, Daten, Analysen" (LWDA). pp. 1–12 (2020).
     
  • Top Comment or Flop Comme... - Download
    13.
    Risch, J., Krestel, R.: Top Comment or Flop Comment? Predicting and Explaining User Engagement in Online News Discussions. Proceedings of the International Conference on Web and Social Media (ICWSM). pp. 579–589. AAAI (2020).
     
  • Automatic Matching of Pai... - Download
    14.
    Jain, N., Bartz, C., Krestel, R.: Automatic Matching of Paintings and Descriptions in Art-Historic Archives using Multimodal Analysis. 1st International Workshop on Artificial Intelligence for Historical Image Enrichment and Access (AI4HI-2020), co-located with LREC 2020 conference (2020).
     
  • Learning Fine-Grained Sem... - Download
    15.
    Jain, N., Krestel, R.: Learning Fine-Grained Semantics for Multi-Relational Data. International Semantic Web Conference, 2020 Posters and Demos (2020).
     
  • Toxic Comment Detection i... - Download
    16.
    Risch, J., Krestel, R.: Toxic Comment Detection in Online Discussions. In: Agarwal, B., Nayak, R., Mittal, N., and Patnaik, S. (eds.) Deep Learning-Based Approaches for Sentiment Analysis. pp. 85–109. Springer (2020).
     
  • Explaining Offensive Lang... - Download
    17.
    Risch, J., Ruff, R., Krestel, R.: Explaining Offensive Language Detection. Journal for Language Technology and Computational Linguistics (JLCL). 34, 29–47 (2020).
    publisher: German Society for Computational Linguistics and Language Technology (GSCL)
     
  • Efficient Discovery of Ma... - Download
    18.
    Schirmer, P., Papenbrock, T., Koumarelas, I., Naumann, F.: Efficient Discovery of Matching Dependencies. ACM Transactions on Database Systems (TODS). 45, 1–33 (2020).
     
  • Data Preparation: A Surve... - Download
    19.
    Hameed, M., Naumann, F.: Data Preparation: A Survey of Commercial Tools. SIGMOD Record. 49, (2020).
     
  • 20.
    Kruse, S., Kaoudi, Z., Quiane-Ruiz, J.-A., Chawla, S., Naumann, F., Contreras-Rojas, B.: RHEEMix in the Data Jungle: A Cost-based Optimizer for Cross-Platform Systems. VLDB Journal. 29, 1287–1310 (2020).
     
  • PatentMatch: A Dataset fo... - Download
    21.
    Risch, J., Alder, N., Hewel, C., Krestel, R.: PatentMatch: A Dataset for Matching Patent Claims with Prior Art. ArXiv e-prints 2012.13919. (2020).
     
  • Hitting Set Enumeration w... - Download
    22.
    Birnick, J., Bläsius, T., Friedrich, T., Naumann, F., Papenbrock, T., Schirneck, M.: Hitting Set Enumeration with Partial Information for Unique Column Combination Discovery. Proceedings of the VLDB Endowment. 13, 2270–2283 (2020).
     
  • 23.
    Jiang, L., Naumann, F.: Holistic Primary Key and Foreign Key Detection. Journal of Intelligent Information Systems. 54, 439–461 (2020).
     
  • MDedup: Duplicate Detecti... - Download
    24.
    Koumarelas, I., Papenbrock, T., Naumann, F.: MDedup: Duplicate Detection with Matching Dependencies. Proceedings of the VLDB Endowment (PVLDB). 13, 712–725 (2020).
     
  • 25.
    Koumarelas, I., Jiang, L., Naumann, F.: Data Preparation for Duplicate Detection. Journal of Data and Information Quality (JDIQ). 12, 1–24 (2020).
     
  • Exploration Interface for... - Download
    26.
    Repke, T., Krestel, R.: Exploration Interface for Jointly Visualised Text and Graph Data. International Conference on Intelligent User Interfaces Companion (IUI ’20). (2020).
     
  • Explainable AI under Cont... - Download
    27.
    Hacker, P., Krestel, R., Grundmann, S., Naumann, F.: Explainable AI under Contract and Tort Law: Legal Incentives and Technical Challenges. Artificial Intelligence and Law. (2020).
     
  • Visualising Large Documen... - Download
    28.
    Repke, T., Krestel, R.: Visualising Large Document Collections by Jointly Modeling Text and Network Structure. Proceedings of the Joint Conference on Digital Libraries (JCDL). (2020).
     
  • 29.
    Caruccio, L., Deufemia, V., Naumann, F., Polese, G.: Discovering Relaxed Functional Dependencies based on Multi-attribute Dominance. Transactions on Knowledge and Data Engineering (TKDE). (2020).
     

2019

  • Optimizing Cross-Platform... - Download
    1.
    Kruse, S., Kaoudi, Z., Quiané-Ruiz, J.-A., Chawla, S., Naumann, F., Contreras-Rojas, B.: Optimizing Cross-Platform Data Movement. Proceedings of the International Conference on Data Engineering (ICDE). pp. 1642–1645 (2019).
     
  • DBChEx: Interactive Explo... - Download
    2.
    Bleifuß, T., Bornemann, L., Kalashnikov, D.V., Naumann, F., Srivastava, D.: DBChEx: Interactive Exploration of Data and Schema Change. Proceedings of the Conference on Innovative Data Systems Research (CIDR) (2019).
     
  • DynFD: Functional Depende... - Download
    3.
    Schirmer, P., Papenbrock, T., Kruse, S., Naumann, F., Hempfing, D., Mayer, T., Neuschäfer-Rube, D.: DynFD: Functional Dependency Discovery in Dynamic Datasets. Proceedings of the International Conference on Extending Database Technology (EDBT). pp. 253–264 (2019).
     
  • Who is Mona L.? Identifyi... - Download
    4.
    Jain, N., Krestel, R.: Who is Mona L.? Identifying Mentions of Artworks in Historical Archives. International Conference on Theory and Practice of Digital Libraries (TPDL 2019). pp. 115–122. Springer (2019).
     
  • Coverage of Information E... - Download
    5.
    Razniewski, S., Jain, N., Mirza, P., Weikum, G.: Coverage of Information Extraction from Sentences and Paragraphs. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (2019).
     
  • Inclusion Dependency Disc... - Download
    6.
    Dürsch, F., Stebner, A., Windheuser, F., Fischer, M., Friedrich, T., Strelow, N., Bleifuß, T., Harmouch, H., Jiang, L., Papenbrock, T., Naumann, F.: Inclusion Dependency Discovery: An Experimental Evaluation of Thirteen Algorithms. Proceedings of the International Conference on Information and Knowledge Management (CIKM). pp. 219–228 (2019).
     
  • A Scoring-based Approach ... - Download
    7.
    Jiang, L., Vitagliano, G., Naumann, F.: A Scoring-based Approach for Data Preparator Suggestion. Lernen, Wissen, Daten, Analysen (LWDA). pp. 6–9 (2019).
     
  • hpiDEDIS at GermEval 2019... - Download
    8.
    Risch, J., Stoll, A., Ziegele, M., Krestel, R.: hpiDEDIS at GermEval 2019: Offensive Language Identification using a German BERT model. Proceedings of the 15th Conference on Natural Language Processing (KONVENS). pp. 403–408. German Society for Computational Linguistics & Language Technology, Erlangen, Germany (2019).
     
  • An Actor Database System ... - Download
    9.
    Schmidl, S., Schneider, F., Papenbrock, T.: An Actor Database System for Akka. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW) - Workshopband. pp. 225–234 (2019).
     
  • The relational database m... - Download
    10.
    Naumann, F.: The relational database management systems genealogy. In: Brodie, M.L. (ed.) Making Databases Work. pp. 173–179. ACM / Morgan & Claypool (2019).
     
  • Domain-specific word embe... - Download
    11.
    Risch, J., Krestel, R.: Domain-specific word embeddings for patent classification. Data Technologies and Applications. 53, 108–122 (2019).
     
  • Measuring and Facilitatin... - Download
    12.
    Risch, J., Krestel, R.: Measuring and Facilitating Data Repeatability in Web Science. Datenbank-Spektrum. 19, 117–126 (2019).
     
  • Mining Business Relations... - Download
    13.
    Kellermeier, T., Repke, T., Krestel, R.: Mining Business Relationships from Stocks and News. MIDAS@ECML-PKDD. (2019).
     
  • Discovery of Approximate ... - Download
    14.
    Pena, E.H.M., de Almeida, E.C., Naumann, F.: Discovery of Approximate (and Exact) Denial Constraints. PVLDB. 13, 266–278 (2019).
     
  • Transforming Pairwise Dup... - Download
    15.
    Draisbach, U., Christen, P., Naumann, F.: Transforming Pairwise Duplicates to Entity Clusters for High Quality Duplicate Detection. ACM Journal on Data and Information Quality (JDIQ). 12, 1–30 (2019).
     

2018

  • Delete or not Delete? Sem... - Download
    1.
    Risch, J., Krestel, R.: Delete or not Delete? Semi-Automatic Comment Moderation for the Newsroom. Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (co-located with COLING). pp. 166–176 (2018).
     
  • Bringing Back Structure t... - Download
    2.
    Repke, T., Krestel, R.: Bringing Back Structure to Free Text Email Conversations with Recurrent Neural Networks. 40th European Conference on Information Retrieval (ECIR 2018). Springer, Grenoble, France (2018).
     
  • Learning Patent Speak: In... - Download
    3.
    Risch, J., Krestel, R.: Learning Patent Speak: Investigating Domain-Specific Word Embeddings. Proceedings of the Thirteenth International Conference on Digital Information Management (ICDIM). pp. 63–68 (2018).
     
  • Fine-Grained Classificati... - Download
    4.
    Risch, J., Krebs, E., Löser, A., Riese, A., Krestel, R.: Fine-Grained Classification of Offensive Language. Proceedings of GermEval (co-located with KONVENS). pp. 38–44 (2018).
     
  • Book Recommendation Beyon... - Download
    5.
    Risch, J., Garda, S., Krestel, R.: Book Recommendation Beyond the Usual Suspects: Embedding Book Plots Together with Place and Time Information. Proceedings of the 20th International Conference On Asia-Pacific Digital Libraries (ICADL). pp. 227–239 (2018).
     
  • Dissecting Company Names ... - Download
    6.
    Loster, M., Hegner, M., Naumann, F., Leser, U.: Dissecting Company Names using Sequence Labeling. Proceedings of the Conference "Lernen, Wissen, Daten, Analysen". pp. 227–238 (2018).
     
  • Towards Progressive Searc... - Download
    7.
    Pietrangelo, A., Simonini, G., Bergamaschi, S., Naumann, F., Koumarelas, I.: Towards Progressive Search-driven Entity Resolution. Italian Symposium on Advanced Database Systems (SEBD) (2018).
     
  • CurEx: A System for Extra... - Download
    8.
    Loster, M., Naumann, F., Ehmueller, J., Feldmann, B.: CurEx: A System for Extracting, Curating, and Exploring Domain-Specific Knowledge Graphs from Text. Proceedings of the ACM International Conference on Information and Knowledge Management. pp. 1883–1886. ACM (2018).
     
  • Beacon in the Dark: A Sys... - Download
    9.
    Repke, T., Krestel, R., Edding, J., Hartmann, M., Hering, J., Kipping, D., Schmidt, H., Scordialo, N., Zenner, A.: Beacon in the Dark: A System for Interactive Exploration of Large Email Corpora. Proceedings of the International Conference on Information and Knowledge Management (CIKM). pp. 1–4. ACM (2018).
     
  • The Challenges of Creatin... - Download
    10.
    Loster, M., Repke, T., Krestel, R., Naumann, F., Ehmueller, J., Feldmann, B., Maspfuhl, O.: The Challenges of Creating, Maintaining and Exploring Graphs of Financial Entities. Proceedings of the Fourth International Workshop on Data Science for Macro-Modeling (DSMM 2018). ACM (2018).
     
  • Challenges for Toxic Comm... - Download
    11.
    van Aken, B., Risch, J., Krestel, R., Löser, A.: Challenges for Toxic Comment Classification: An In-Depth Error Analysis. Proceedings of the 2nd Workshop on Abusive Language Online (co-located with EMNLP). pp. 33–42 (2018).
     
  • Aggression Identification... - Download
    12.
    Risch, J., Krestel, R.: Aggression Identification Using Deep Learning and Data Augmentation. Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (co-located with COLING). pp. 150–158 (2018).
     
  • Piggyback Profiling: Enha... - Download
    13.
    Exeler, C., Graber, M., Junge, T., Ramson, S., Ramson, C., Tschirschnitz, F., Naumann, F.: Piggyback Profiling: Enhancing Query Results with Metadata. Lernen. Wissen. Daten. Analysen. (LWDA) (2018).
     
  • My Approach = Your Appara... - Download
    14.
    Risch, J., Krestel, R.: My Approach = Your Apparatus? Entropy-Based Topic Modeling on Multiple Domain-Specific Text Collections. Proceedings of the 18th ACM/IEEE Joint Conference on Digital Libraries (JCDL). pp. 283–292 (2018).
     
  • Discovery of Genuine Func... - Download
    15.
    Berti-Equille, L., Harmouch, H., Naumann, F., Novelli, N., Thirumuruganathan, S.: Discovery of Genuine Functional Dependencies from Relational Data with Missing Values. Proceedings of the VLDB Endowment (PVLDB). pp. 880–892 (2018).
     
  • Where in the World Is Car... - Download
    16.
    Lazaridou, K., Gruetze, T., Naumann, F.: Where in the World Is Carmen Sandiego? Detecting Person Locations via Social Media Discussions. Proceedings of the ACM Conference on Web Science. ACM (2018).
     
  • Prediction for the Newsro... - Download
    17.
    Ambroselli, C., Risch, J., Krestel, R., Loos, A.: Prediction for the Newsroom: Which Articles Will Get the Most Comments?. Proceedings of the 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). pp. 193–199. ACL, New Orleans, Louisiana, USA (2018).
     
  • WELDA: Enhancing Topic Mo... - Download
    18.
    Bunk, S., Krestel, R.: WELDA: Enhancing Topic Models by Incorporating Local Word Contexts. Joint Conference on Digital Libraries (JCDL 2018). ACM, Forth Worth, Texas, USA (2018).
     
  • 19.
    Abedjan, Z., Golab, L., Naumann, F., Papenbrock, T.: Data Profiling. Morgan & Claypool Publishers (2018).
     
  • 20.
    Bornemann, L., Bleifuß, T., Kalashnikov, D., Naumann, F., Srivastava, D.: Data Change Exploration using Time Series Clustering. Datenbank-Spektrum. 18, 1–9 (2018).
     
  • Data Quality – The Role... - Download
    21.
    Sadiq, S., Dasu, T., Dong, X.L., Freire, J., Ilyas, I.F., Link, S., Miller, R.J., Naumann, F., Zhou, X., Srivastava, D.: Data Quality – The Role of Empiricism. SIGMOD Record. 46, 35–43 (2018).
     
  • Topic-aware Network Visua... - Download
    22.
    Repke, T., Krestel, R.: Topic-aware Network Visualisation to Explore Large Email Corpora. International Workshop on Big Data Visual Exploration and Analytics (BigVis). (2018).
     
  • Efficient Discovery of Ap... - Download
    23.
    Kruse, S., Naumann, F.: Efficient Discovery of Approximate Dependencies. Proceedings of the VLDB Endowment. 11, 759–772 (2018).
    See abstract for errata
     
  • Experience: Enhancing Add... - Download
    24.
    Koumarelas, I., Kroschk, A., Mosley, C., Naumann, F.: Experience: Enhancing Address Matching with Geocoding and Similarity Measure Selection. Journal of Data and Information Quality (JDIQ). 10, 8:1–8:16 (2018).
     
  • Exploring Change - A New ... - Download
    25.
    Bleifuß, T., Bornemann, L., Johnson, T., Kalashnikov, D.V., Naumann, F., Srivastava, D.: Exploring Change - A New Dimension of Data Analytics. Proceedings of the VLDB Endowment (PVLDB). 12, 85–98 (2018).
     
  • RHEEM: Enabling Cross-Pla... - Download
    26.
    Agrawal, D., Chawla, S., Kaoudi, Z., Kruse, S., Quiané-Ruiz, J.A., Contreras-Rojas, B., Elmagarmid, A., Idris, Y., Lucas, J., Mansour, E., Ouzzani, M., Papotti, P., Tang, N., Thirumuruganathan, S., Troudi, A.: RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -. Proceedings of the VLDB Endowment (PVLDB). 11, (2018).
     

2017

  • Metacrate: Organize and A... - Download
    1.
    Kruse, S., Hahn, D., Walter, M., Naumann, F.: Metacrate: Organize and Analyze Millions of Data Profiles. Proceedings of the International Conference on Information and Knowledge Management (CIKM). pp. 2483–2486. ACM (2017).
     
  • Comparing Features for Ra... - Download
    2.
    Repke, T., Loster, M., Krestel, R.: Comparing Features for Ranking Relationships Between Financial Entities Based on Text. Proceedings of the 3rd International Workshop on Data Science for Macro--Modeling with Financial and Economic Datasets. pp. 12:1–12:2. ACM, New York, NY, USA (2017).
     
  • 3.
    Lazaridou, K., Krestel, R., Naumann, F.: Identifying Media Bias by Analyzing Reported Speech. International Conference on Data Mining. IEEE (2017).
     
  • Real or Fake? Large-Scale... - Download
    4.
    Maschler, F., Niephaus, F., Risch, J.: Real or Fake? Large-Scale Validation of Identity Leaks. 47. Jahrestagung der Gesellschaft für Informatik (INFORMATIK). pp. 2437–2448 (2017).
     
  • Cardinality Estimation: A... - Download
    5.
    Harmouch, H., Naumann, F.: Cardinality Estimation: An Experimental Survey. Proceedings of the VLDB Endowment (PVLDB). pp. 499–512 (2017).
     
  • How Do Search Engines Wor... - Download
    6.
    Krestel, R., Risch, J.: How Do Search Engines Work? A Massive Open Online Course with 4000 Participants. Proceedings of the Conference Lernen, Wissen, Daten, Analysen. pp. 259–271 (2017).
     
  • Data Profiling (tutorial) - Download
    7.
    Abedjan, Z., Golab, L., Naumann, F.: Data Profiling (tutorial). Proceedings of the International Conference on Management of Data (SIGMOD) (2017).
     
  • What Should I Cite? Cross... - Download
    8.
    Risch, J., Krestel, R.: What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers. Proceedings of the International Conference on Theory and Practice of Digital Libraries (TPDL). pp. 40–46 (2017).
     
  • A Hybrid Approach for Eff... - Download
    9.
    Papenbrock, T., Naumann, F.: A Hybrid Approach for Efficient Unique Column Combination Discovery. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW). pp. 195–204 (2017).
     
  • Data-driven Schema Normal... - Download
    10.
    Papenbrock, T., Naumann, F.: Data-driven Schema Normalization. Proceedings of the International Conference on Extending Database Technology (EDBT). pp. 342–353 (2017).
     
  • Fast Approximate Discover... - Download
    11.
    Kruse, S., Papenbrock, T., Dullweber, C., Finke, M., Hegner, M., Zabel, M., Zöllner, C., Naumann, F.: Fast Approximate Discovery of Inclusion Dependencies. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW). pp. 207–226 (2017).
     
  • What was Hillary Clinton ... - Download
    12.
    Gruetze, T., Krestel, R., Lazaridou, K., Naumann, F.: What was Hillary Clinton doing in Katy, Texas?. Proceedings of the 26th International Conference on World Wide Web, WWW 2017, Perth, Australia, 3-7 April, 2017. ACM (2017).
     
  • Improving Company Recogni... - Download
    13.
    Loster, M., Zuo, Z., Naumann, F., Maspfuhl, O., Thomas, D.: Improving Company Recognition from Unstructured Text by using Dictionaries. Proceedings of the International Conference on Extending Database Technology. pp. 610–619 (2017).
     
  • Enabling Change Explorati... - Download
    14.
    Bleifuß, T., Johnson, T., Kalashnikov, D.V., Naumann, F., Shkapenyuk, V., Srivastava, D.: Enabling Change Exploration (Vision). Proceedings of the Fourth International Workshop on Exploratory Search in Databases and the Web (ExploreDB). pp. 1–3 (2017).
     
  • Uncovering Business Relat... - Download
    15.
    Zuo, Z., Loster, M., Krestel, R., Naumann, F.: Uncovering Business Relationships: Context-sensitive Relationship Extraction for Difficult Relationship Types. Proceedings of the Conference "Lernen, Wissen, Daten, Analysen" (LWDA) (2017).
     
  • 16.
    Giesler, M.J., Keller, B., Repke, T., Leonhart, R., Weis, J., Muckelbauer, R., Rieckmann, N., Müller-Nordhorn, J., Lucius-Hoene, G., Holmberg, C.: Effect of a Website That Presents Patients’ Experiences on Self-Efficacy and Patient Competence of Colorectal Cancer Patients: Web-Based Randomized Controlled Trial. J Med Internet Res. 19, e334 (2017).
     
  • Detecting Inclusion Depen... - Download
    17.
    Tschirschnitz, F., Papenbrock, T., Naumann, F.: Detecting Inclusion Dependencies on Very Many Tables. ACM Transactions on Database Systems (TODS). 42, 18:1–18:29 (2017).
     
  • 18.
    Heller, D., Krestel, R., Ohler, U., Vingron, M., Marsico, A.: ssHMM: Extracting Intuitive Sequence-Structure Motifs from High-Throughput RNA-Binding Protein Data. Nucleic Acid Research. 45, 11004–11018 (2017).
     
  • Efficient Denial Constrai... - Download
    19.
    Bleifuß, T., Kruse, S., Naumann, F.: Efficient Denial Constraint Discovery with Hydra. Proceedings of the VLDB Endowment (PVLDB). 11, 311–323 (2017).
     
  • Das Fachgebiet „Informa... - Download
    20.
    Naumann, F., Krestel, R.: Das Fachgebiet „Informationssysteme“ am Hasso-Plattner-Institut. Datenbankspektrum. 17, 69–76 (2017).
     

2016

  • 1.
    Krestel, R., Mottin, D., Müller, E. eds.: Proceedings of the Conference "Lernen, Wissen, Daten, Analysen", Potsdam, Germany, September 12-14, 2016. CEUR-WS.org (2016).
     
  • TextAI: Enhancing TextAE ... - Download
    2.
    Grundke, M., Jasper, J., Perchyk, M., Sachse, J.P., Krestel, R., Neves, M.: TextAI: Enhancing TextAE with Intelligent Annotation Support. Proceedings of the 7th International Symposium on Semantic Mining in Biomedicine (SMBM 2016). pp. 80–84. CEUR-WS.org (2016).
     
  • Analyzing NIH Funding Pat... - Download
    3.
    Park, J., Blume-Kohout, M., Krestel, R., Nalisnick, E., Smyth, P.: Analyzing NIH Funding Patterns over Time with Statistical Text Analysis. Scholarly Big Data: AI Perspectives, Challenges, and Ideas (SBD 2016) Workshop at AAAI 2016. AAAI (2016).
     
  • Holistic Data Profiling: ... - Download
    4.
    Ehrlich, J., Roick, M., Schulze, L., Zwiener, J., Papenbrock, T., Naumann, F.: Holistic Data Profiling: Simultaneous Discovery of Various Metadata. Proceedings of the International Conference on Extending Database Technology (EDBT). pp. 305–316. OpenProceedings.org (2016).
     
  • A Hybrid Approach to Func... - Download
    5.
    Papenbrock, T., Naumann, F.: A Hybrid Approach to Functional Dependency Discovery. Proceedings of the International Conference on Management of Data (SIGMOD). pp. 821–833. ACM, New York, NY, USA (2016).
     
  • Data Profiling (tutorial) - Download
    6.
    Ziawasch Abedjan, L.G., Naumann, F.: Data Profiling (tutorial). International Conference on Data Engineering (ICDE) (2016).
     
  • Which Answer is Best? Pre... - Download
    7.
    Jenders, M., Krestel, R., Naumann, F.: Which Answer is Best? Predicting Accepted Answers in MOOC Forums. Proceedings of the 25th International Conference Companion on World Wide Web. pp. 679–684. International World Wide Web Conferences Steering Committee (2016).
     
  • Topic Shifts in StackOver... - Download
    8.
    Gruetze, T., Krestel, R., Naumann, F.: Topic Shifts in StackOverflow: Ask it like Socrates. Lecture Notes in Computer Science. pp. 213–221. Springer (2016).
     
  • Rheem: Enabling Multi-Pla... - Download
    9.
    Agrawal, D., Ba, L., Berti-Equille, L., Chawla, S., Elmagarmid, A., Hammady, H., Idris, Y., Kaoudi, Z., Khayyat, Z., Kruse, S., Ouzzani, M., Papotti, P., Quiané-Ruiz, J.-A., Tang, N., Zaki, M.J.: Rheem: Enabling Multi-Platform Task Execution (demo). Proceedings of the ACM SIGMOD conference (SIGMOD) (2016).
     
  • Classification of German ... - Download
    10.
    Godde, C., Lazaridou, K., Krestel, R.: Classification of German Newspaper Comments. Proceedings of the Conference Lernen, Wissen, Daten, Analysen. pp. 299–310. CEUR-WS.org (2016).
     
  • RDFind: Scalable Conditio... - Download
    11.
    Kruse, S., Jentzsch, A., Papenbrock, T., Kaoudi, Z., Quiane-Ruiz, J.-A., Naumann, F.: RDFind: Scalable Conditional Inclusion Dependency Discovery in RDF Datasets. Proceedings of the International Conference on Management of Data (SIGMOD). pp. 953–967. ACM, New York, NY, USA (2016).
     
  • Combination of Rule-based... - Download
    12.
    Samiei, A., Koumarelas, I., Loster, M., Naumann, F.: Combination of Rule-based and Textual Similarity Approaches to Match Financial Entities. Data Science for Macro-Modeling with Financial and Economic Datasets (DSMM). ACM (2016).
     
  • 13.
    Samiei, A., Naumann, F.: Cluster-based Sorted Neighborhood for Efficient Duplicate Detection. International Conference on Data Mining Workshops (ICDMW) (2016).
     
  • Approximate Discovery of ... - Download
    14.
    Bleifuß, T., Bülow, S., Frohnhofen, J., Risch, J., Wiese, G., Kruse, S., Papenbrock, T., Naumann, F.: Approximate Discovery of Functional Dependencies for Large Datasets. Proceedings of the International Conference on Information and Knowledge Management (CIKM). pp. 1803–1812. ACM, New York, NY, USA (2016).
     
  • 15.
    Lazaridou, K., Krestel, R.: Identifying Political Bias in News Articles. International Conference on Theory and Practice of Digital Libraries. IEEE Technical Committee on Digital Libraries (2016).
    TPDL Doctoral Consortium
     
  • Data Anamnesis: Admitting... - Download
    16.
    Kruse, S., Papenbrock, T., Harmouch, H., Naumann, F.: Data Anamnesis: Admitting Raw Data into an Organization. IEEE Data Engineering Bulletin. 39, 8–20 (2016).
     
  • CohEEL: Coherent and Effi... - Download
    17.
    Gruetze, T., Kasneci, G., Zuo, Z., Naumann, F.: CohEEL: Coherent and Efficient Named Entity Linking through Random Walks. Web Semantics: Science, Services and Agents on the World Wide Web. 37, 75–89 (2016).
     
  • 18.
    Langer, P., Naumann, F.: Efficient Order Dependency Discovery. VLDB Journal. 25, 223–241 (2016).
     
  • The Information Systems G... - Download
    19.
    Naumann, F., Krestel, R.: The Information Systems Group at HPI. SIGMOD Record. (2016).
     

2015

  • 1.
    Jentzsch, A., Mühleisen, H., Naumann, F.: Uniqueness, Density, and Keyness: Exploring Class Hierarchies. In Proceedings of 6th International Workshop on Consuming Linked Data (COLD 2015), ISWC 2015. , Bethlehem, PA, USA (2015).
     
  • 2.
    Jentzsch, A., Dullweber, C., Troiano, P., Naumann, F.: Exploring Linked Data Graph Structures. In Proceedings of Posters and Demos Session, ISWC2015. , Bethlehem, PA, USA (2015).
     
  • Social Media Story Tellin... - Download
    3.
    Hennig, P., Berger, P., Dullweber, C., Finke, M., Maschler, F., Risch, J., Meinel, C.: Social Media Story Telling. Proceedings of the 8th IEEE International Conference on Social Computing and Networking (SocialCom2015). pp. 279–284. , Chengdu, China (2015).
     
  • Ergonomic Interaction for... - Download
    4.
    Schmidt, D., Frohnhofen, J., Knebel, S., Meinel, F., Perchyk, M., Risch, J., Striebel, J., Wachtel, J., Baudisch, P.: Ergonomic Interaction for Touch Floors. Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. pp. 3879–3888. ACM, Seoul, Republic of Korea (2015).
     
  • How to Stay Up-to-date on... - Download
    5.
    Roick, M., Jenders, M., Krestel, R.: How to Stay Up-to-date on Twitter with General Keywords. Proceedings of the LWA 2015 Workshops: KDML, FGWM, IR, and FGDB. CEUR-WS.org (2015).
     
  • A Serendipity Model For N... - Download
    6.
    Jenders, M., Lindhauer, T., Kasneci, G., Krestel, R., Naumann, F.: A Serendipity Model For News Recommendation. KI 2015: Advances in Artificial Intelligence - 38th Annual German Conference on AI, Dresden, Germany, September 21-25, 2015, Proceedings. pp. 111–123. Springer (2015).
     
  • Scaling Out the Discovery... - Download
    7.
    Kruse, S., Papenbrock, T., Naumann, F.: Scaling Out the Discovery of Inclusion Dependencies. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW). pp. 445–454 (2015).
     
  • Online Temporal Summariza... - Download
    8.
    Schubotz, T., Krestel, R.: Online Temporal Summarization of News Events. Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT). pp. 679–684. IEEE Computer Society (2015).
     
  • Learning Temporal Tagging... - Download
    9.
    Gruetze, T., Yao, G., Krestel, R.: Learning Temporal Tagging Behaviour. Proceedings of the 24th International Conference on World Wide Web Companion (WWW). pp. 1333–1338. ACM (2015).
     
  • Tweet-Recommender: Findin... - Download
    10.
    Krestel, R., Werkmeister, T., Wiradarma, T.P., Kasneci, G.: Tweet-Recommender: Finding Relevant Tweets for News Articles. Proceedings of the 24th International World Wide Web Conference (WWW). ACM (2015).
     
  • Estimating Data Integrati... - Download
    11.
    Kruse, S., Papotti, P., Naumann, F.: Estimating Data Integration and Cleaning Effort. Proceedings of the International Conference on Extending Database Technology (EDBT) (2015).
     
  • Functional Dependency Dis... - Download
    12.
    Papenbrock, T., Ehrlich, J., Marten, J., Neubert, T., Rudolph, J.-P., Schönberg, M., Zwiener, J., Naumann, F.: Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms. Proceedings of the VLDB Endowment. 8, 1082–1093 (2015).