Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
  
 

Publications

2021


  • Tim Repke, Ralf Krestel: Interactive Curation of Semantic Representations in Digital Libraries. Proceedings of the International Conference on Asia-Pacific Digital Libraries (ICADL), 2021 (to appear)
    [Paper] 
  • Nitisha Jain, Jan-Christoph Kalo, Wolf-Tilo Balke, Ralf Krestel: Do Embeddings Actually Capture Knowledge Graph Semantics?. Proceedings of the International Conference on Principles of Knowledge Representation and Reasoning (KR), 2021 (to appear)
    [ExtendedAbstract] 
  • Tobias Bleifuß, Leon Bornemann, Dmitri V. Kalashnikov, Felix Naumann, Divesh Srivastava: The Secret Life of Wikipedia Tables. Proceedings of the Workshop on Search, Exploration, and Analysis in Heterogeneous Datastores (SEA-Data@VLDB), 2021
    [Paper]  [CEUR-WS]  [Project] 
  • Nitisha Jain, Trung-Kien Tran, Mohamed H. Gad-Elrab, Daria Stepanova: Improving Knowledge Graph Embeddings with Ontological Reasoning. Proceedings of the International Semantic Web Conference (ISWC), 2021 (to appear)
    [Paper] 
  • Sebastian Schmidl, Thorsten Papenbrock: Efficient Distributed Discovery of Bidirectional Order Dependencies. The VLDB Journal (2021)
    [Paper]  [Poster]  [Project Page]  [DOI:10.1007/s00778-021-00683-4]
  • Jan Kossmann, Thorsten Papenbrock, Felix Naumann: Data dependencies for query optimization: a survey. The VLDB Journal (2021) (to appear)
    [Paper] 
  • Julian Risch, Nicolas Alder, Christoph Hewel, Ralf Krestel: PatentMatch: A Dataset for Matching Patent Claims & Prior Art. Proceedings of the Workshop on Patent Text Mining and Semantic Technologies (PatentSemTech@SIGIR), 2021
    [Paper]  [Project Page] 
  • Julian Risch, Philipp Schmidt, Ralf Krestel: Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in One Unified Format. Proceedings of the Workshop on Online Abuse and Harms (WOAH@ACL), 2021
    [Paper]  [GitHub] 
  • Tim Repke, Ralf Krestel: Extraction and Representation of Financial Entities from Text. Data Science for Economics and Finance. Springer, 2021
    [Paper]  [Springer]  [DOI:10.1007/978-3-030-66891-4_11]
  • Mohamed Karim Belaid, Maximilian Rabus, Ralf Krestel: CrashNet: an encoder–decoder architecture to predict crash test outcomes. Data Mining and Knowledge Discovery (2021)
    [Springer]  [DOI:10.1007/s10618-021-00761-9]
  • Robert Schwanhold, Tim Repke, Ralf Krestel: Modeling the Evolution of Word Senses with Force-Directed Layouts of Co-occurrence Networks. Proceedings of the International Workshop on Computational Approaches to Historical Language Change (LChange@ACL), 2021 (to appear)
    [Paper]  [Project] 
  • Fabian Panse, Felix Naumann: Evaluation of Duplicate Detection Algorithms: From Quality Measures to Test Data Generation (tutorial). Proceedings of the International Conference on Data Engineering (ICDE), 2021
    [Paper] 
  • Johannes Schneider, Phillip Wenig, Thorsten Papenbrock: Distributed detection of sequential anomalies in univariate time series. The VLDB Journal (2021)
    [DOI:10.1007/s00778-021-00657-6]
  • Julian Risch, Philipp Hager, Ralf Krestel: Multifaceted Domain-Specific Document Embeddings. Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations)(NAACL), 2021
    [Paper]  [Project Page] 
  • Julian Weise, Sebastian Schmidl, Thorsten Papenbrock: Optimized Theta-Join Processing. Proceedings of the Conference on Database Systems for Business, Technology, and Web (BTW), 2021
    [Paper]  [Project Page]  [DOI:10.18420/btw2021-03]
  • Nitisha Jain, Jan-Christoph Kalo, Wolf-Tilo Balke, Ralf Krestel: Do Embeddings Actually Capture Knowledge Graph Semantics?. Proceedings of the Extended Semantic Web Conference (ESWC), 2021
    [Paper]  [URL]  [DOI:10.1007/978-3-030-77385-4_9]
  • Tobias Bleifuß, Leon Bornemann, Dmitri V. Kalashnikov, Felix Naumann, Divesh Srivastava: Structured Object Matching across Web Page Revisions. Proceedings of the International Conference on Data Engineering (ICDE), 2021
    [Paper]  [IEEE]  [Project]  [DOI:10.1109/ICDE51399.2021.00115]
  • Julian Risch, Tim Repke, Lasse Kohlmeyer, Ralf Krestel: ComEx: Comment Exploration on Online News Platforms. Joint Proceedings of the ACM IUI Workshops co-located with the ACM Conference on Intelligent User Interfaces (IUI), 2021
    [Paper]  [GitHub]  [Project]  [CEUR-WS] 
  • Hazar Harmouch, Thorsten Papenbrock, Felix Naumann: Relational Header Discovery using Similarity Search in a Table Corpus. Proceedings of the International Conference on Data Engineering (ICDE) (2021)
    [DOI:10.1109/ICDE51399.2021.00045]
  • Lan Jiang, Gerardo Vitagliano, Felix Naumann: Structure Detection in Verbose CSV Files. Proceedings of the International Conference on Extending Database Technology (EDBT), 2021
    [Paper]  [GitHub] 
  • Michael Loster, Davide Mottin, Paolo Papotti, Felix Naumann, Jan Ehmueller, Benjamin Feldmann: Few-Shot Knowledge Validation using Rules. Proceedings of the Web Conference, 2021
  • Tim Repke, Ralf Krestel: Robust Visualisation of Dynamic Text Collections: Measuring and Comparing Dimensionality Reduction Algorithms. Proceedings of the Conference on Human Information Interaction and Retrieval (CHIIR), 2021
    [Paper]  [DOI:10.1145/3406522.3446034]
  • Nicolas Alder, Tobias Bleifuß, Leon Bornemann, Felix Naumann, Tim Repke: Ein Data Engineering Kurs für 10.000 Teilnehmer. Datenbank-Spektrum 20:(1), 2021
    [Paper]  [Springer]  [openHPI]  [DOI:10.1007/s13222-020-00354-8]
  • Michael Loster, Ioannis Koumarelas, Felix Naumann: Knowledge Transfer for Entity Resolution with Siamese Neural Networks. Journal of Data and Information Quality (JDIQ) 13:(1), 2021
    [DOI:10.1145/3410157]


2020


  • Loredana Caruccio, Vincenzo Deufemia, Felix Naumann, Giuseppe Polese: Discovering Relaxed Functional Dependencies based on Multi-attribute Dominance. Transactions on Knowledge and Data Engineering (TKDE) (2020)
    [IEEE]  [DOI:10.1109/TKDE.2020.2967722]
  • Julian Risch, Nicolas Alder, Christoph Hewel, Ralf Krestel: PatentMatch: A Dataset for Matching Patent Claims with Prior Art. ArXiv (2020)
    [Paper]  [Project Page] 
  • Nitisha Jain, Christian Bartz, Tobias Bredow, Emanuel Metzenthin, Jona Otholt, Ralf Krestel: Semantic Analysis of Cultural Heritage Data: Aligning Paintings and Descriptions in Art-Historic Collections. Proceedings of the International Workshop on Fine Art Pattern Extraction and Recognition (FAPER@ICPR), 2020
    [Paper]  [Springer]  [DOI:10.1007/978-3-030-68796-0_37]
  • Julian Risch, Victor Künstler, Ralf Krestel: HyCoNN: Hybrid Cooperative Neural Networks for Personalized News Discussion Recommendation. Proceedings of the International Joint Conferences on Web Intelligence and Intelligent Agent Technologies (WI-IAT), 2020
    [Paper]  [GitHub] 
  • Nitisha Jain, Ralf Krestel: Learning Fine-Grained Semantics for Multi-Relational Data. Proceedings of the International Semantic Web Conference, Posters and Demos (ISWC), 2020
    [Paper]  [Poster] 
  • Mazhar Hameed, Felix Naumann: Data Preparation: A Survey of Commercial Tools. SIGMOD Record 49:(3), 2020
    [Paper] 
  • Eduardo H. M. Pena, Edson R. L. Filho, Eduardo C. de Almeida, Felix Naumann: Efficient Detection of Data Dependency Violations. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2020
    [Paper] 
  • Johann Birnick, Thomas Bläsius, Tobias Friedrich, Felix Naumann, Thorsten Papenbrock, Martin Schirneck: Hitting Set Enumeration with Partial Information for Unique Column Combination Discovery. PVLDB 13:(11), 2020
    [Paper]  [DOI:10.14778/3407790.3407824]
  • Julian Risch, Ralf Krestel: A Dataset of Journalists' Interactions with Their Readership: When Should Article Authors Reply to Reader Comments?. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2020
    [Paper]  [GitHub]  [DOI:10.1145/3340531.3412764]
  • Ali Ehteshami Bejnordi, Ralf Krestel: Dynamic Channel and Layer Gating in Convolutional Neural Networks. Proceedings of the German Conference on Artificial Intelligence (KI), 2020
    [Paper] 
  • Jan Ehmüller, Lasse Kohlmeyer, Holly McKee, Daniel Paeschke, Tim Repke, Ralf Krestel, Felix Naumann: Sense Tree: Discovery of New Word Senses with Graph-based Scoring. Lernen, Wissen, Daten, Analysen (LWDA), 2020
    [Paper]  [CEUR-WS]  [Project] 
  • Nitisha Jain: Multimodal Knowledge Graphs for Semantic Analysis of Cultural Heritage Data. Invited Talk at the Workshop on Knowledge Bases and Multiple Modalities (KBMM@AKBC), 2020
    [Paper] 
  • Philipp Schirmer, Thorsten Papenbrock, Ioannis Koumarelas, Felix Naumann: Efficient Discovery of Matching Dependencies. Transactions on Database Systems (TODS) 45:(3), 2020
    [Paper]  [DOI:10.1145/3392778]
  • Julian Risch, Robin Ruff, Ralf Krestel: Explaining Offensive Language Detection. Journal for Language Technology and Computational Linguistics (JLCL) 34:(1), 2020
    [Paper]  [GitHub] 
  • Konstantina Lazaridou, Alexander Löser, Maria Mestre, Felix Naumann: Discovering Biased News Articles Leveraging Multiple Human Annotations. Proceedings of the Conference on Language Resources and Evaluation (LREC), 2020
    [Paper]  [Paper] 
  • Julian Risch, Robin Ruff, Ralf Krestel: Offensive Language Detection Explained. Proceedings of the Workshop on Trolling, Aggression and Cyberbullying (TRAC@LREC), 2020
    [Paper]  [GitHub] 
  • Julian Risch, Samuele Garda, Ralf Krestel: Hierarchical Document Classification as a Sequence Generation Task. Proceedings of the Joint Conference on Digital Libraries (JCDL), 2020
    [Paper]  [GitHub] 
  • Sebastian Kruse, Zoi Kaoudi, Jorge-Arnulfo Quiane-Ruiz, Sanjay Chawla, Felix Naumann, Bertty Contreras-Rojas: RHEEMix in the Data Jungle: A Cost-based Optimizer for Cross-Platform Systems. The VLDB Journal 29:(6), 2020
    [URL] 
  • Julian Risch, Ralf Krestel: Bagging BERT Models for Robust Aggression Identification. Proceedings of the Workshop on Trolling, Aggression and Cyberbullying (TRAC@LREC), 2020
    [Paper]  [GitHub] 
  • Nitisha Jain: Domain-Specific Knowledge Graph Construction for Semantic Analysis. Proceedings of the Extended Semantic Web Conference (ESWC), 2020
    [Paper]  [URL]  [DOI:10.1007/978-3-030-62327-2_40]
  • Nitisha Jain, Christian Bartz, Ralf Krestel: Automatic Matching of Paintings and Descriptions in Art-Historic Archives using Multimodal Analysis. Proceedings of the International Workshop on Artificial Intelligence for Historical Image Enrichment and Access (AI4HI@LREC), 2020
    [Paper]  [URL] 
  • Julian Risch, Ralf Krestel: Top Comment or Flop Comment? Predicting and Explaining User Engagement in Online News Discussions. Proceedings of the International Conference on Web and Social Media (ICWSM), 2020
    [Paper]  [GitHub] 
  • Tim Repke, Ralf Krestel: Visualising Large Document Collections by Jointly Modeling Text and Network Structure. Proceedings of the Joint Conference on Digital Libraries (JCDL), 2020
    [Paper]  [Project]  [DOI:10.1145/3383583.3398524]
  • Tim Repke, Ralf Krestel: Exploration Interface for Jointly Visualised Text and Graph Data. Proceedings of the International Conference on Intelligent User Interfaces Companion (IUI), 2020
    [Paper]  [Project]  [DOI:10.1145/3379336.3381470]
  • Leon Bornemann, Tobias Bleifuß, Dmitri V. Kalashnikov, Felix Naumann, Divesh Srivastava: Natural Key Discovery in Wikipedia Tables. Proceedings of The World Wide Web Conference (WWW), 2020
    [Paper]  [DOI:10.1145/3366423.3380039]
  • Ioannis Koumarelas, Lan Jiang, Felix Naumann: Data Preparation for Duplicate Detection. Journal of Data and Information Quality (JDIQ) 12:(3), 2020
    [DOI:10.1145/3377878]
  • Philipp Hacker, Ralf Krestel, Stefan Grundmann, Felix Naumann: Explainable AI under Contract and Tort Law: Legal Incentives and Technical Challenges. Artificial Intelligence and Law (2020)
    [Paper] 
  • Ioannis Koumarelas, Thorsten Papenbrock, Felix Naumann: MDedup: Duplicate Detection with Matching Dependencies. PVLDB 13:(5), 2020
    [Paper] 
  • Lan Jiang, Felix Naumann: Holistic Primary Key and Foreign Key Detection. Journal of Intelligent Information Systems 54:(3), 2020
    [DOI:10.1007/s10844-019-00562-z]
  • Julian Risch, Ralf Krestel: Toxic Comment Detection in Online Discussions. Deep Learning-Based Approaches for Sentiment Analysis. Springer, 2020
    [Paper]  [DOI:10.1007/978-981-15-1216-2]


2019


  • Sebastian Schmidl, Frederic Schneider, Thorsten Papenbrock: An Actor Database System for Akka. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW) - Workshopband, 2019
    [Paper]  [DOI:10.18420/btw2019-ws-23]
  • Simon Razniewski, Nitisha Jain, Paramita Mirza, Gerhard Weikum: Coverage of Information Extraction from Sentences and Paragraphs. Proceedings of the Conference on Empirical Methods in Natural Language Processing and the International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019
    [Paper]  [ACL Web]  [DOI:10.18653/v1/D19-1583]
  • Eduardo H. M. Pena, Eduardo C. de Almeida, Felix Naumann: Discovery of Approximate (and Exact) Denial Constraints. PVLDB 13:(3), 2019
    [Paper] 
  • Julian Risch, Anke Stoll, Marc Ziegele, Ralf Krestel: hpiDEDIS at GermEval 2019: Offensive Language Identification using a German BERT model. Proceedings of the Conference on Natural Language Processing (KONVENS), 2019
    [Paper]  [GitHub] 
  • Lan Jiang, Gerardo Vitagliano, Felix Naumann: A Scoring-based Approach for Data Preparator Suggestion. Lernen, Wissen, Daten, Analysen (LWDA), 2019
    [Paper] 
  • Falco Dürsch, Axel Stebner, Fabian Windheuser, Maxi Fischer, Tim Friedrich, Nils Strelow, Tobias Bleifuß, Hazar Harmouch, Lan Jiang, Thorsten Papenbrock, Felix Naumann: Inclusion Dependency Discovery: An Experimental Evaluation of Thirteen Algorithms. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2019
    [Paper]  [Code]  [DOI:10.1145/3357384.3357916]
  • Uwe Draisbach, Peter Christen, Felix Naumann: Transforming Pairwise Duplicates to Entity Clusters for High Quality Duplicate Detection. Journal of Data and Information Quality (JDIQ) 12:(1), 2019
    [Paper] 
  • Nitisha Jain, Ralf Krestel: Who is Mona L.? Identifying Mentions of Artworks in Historical Archives. International Conference on Theory and Practice of Digital Libraries (TPDL), 2019
    [Paper]  [Springer]  [DOI:10.1007/978-3-030-30760-8_10]
  • Thomas Kellermeier, Tim Repke, Ralf Krestel: Mining Business Relationships from Stocks and News. Proceedings of the Workshop on Mining Data for Financial Applications (MIDAS@ECML-PKDD), 2019
    [Paper]  [DOI:10.1007/978-3-030-37720-5_6]
  • Philipp Schirmer, Thorsten Papenbrock, Sebastian Kruse, Felix Naumann, Dennis Hempfing, Torben Mayer, Daniel Neuschäfer-Rube: DynFD: Functional Dependency Discovery in Dynamic Datasets. Proceedings of the International Conference on Extending Database Technology (EDBT), 2019
    [Paper]  [DOI:10.5441/002/edbt.2019.23]
  • Julian Risch, Ralf Krestel: Measuring and Facilitating Data Repeatability in Web Science. Datenbank-Spektrum 19:(2), 2019
    [Paper]  [GitHub]  [DOI:10.1007/s13222-019-00316-9]
  • Julian Risch, Ralf Krestel: Domain-specific word embeddings for patent classification. Data Technologies and Applications 53:(1), 2019
    [Paper]  [Project Page]  [DOI:10.1108/DTA-01-2019-0002]
  • Felix Naumann: The relational database management systems genealogy. Making Databases Work. ACM / Morgan & Claypool, 2019
    [Paper] 
  • Sebastian Kruse, Zoi Kaoudi, Jorge-Arnulfo Quiané-Ruiz, Sanjay Chawla, Felix Naumann, Bertty Contreras-Rojas: Optimizing Cross-Platform Data Movement. Proceedings of the International Conference on Data Engineering (ICDE), 2019
    [Paper] 
  • Tobias Bleifuß, Leon Bornemann, Dmitri V. Kalashnikov, Felix Naumann, Divesh Srivastava: DBChEx: Interactive Exploration of Data and Schema Change. Proceedings of the Conference on Innovative Data Systems Research (CIDR), 2019
    [Paper]  [CIDRDB] 


2018


  • Michael Loster, Felix Naumann, Jan Ehmueller, Benjamin Feldmann: CurEx: A System for Extracting, Curating, and Exploring Domain-Specific Knowledge Graphs from Text. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2018
    [Paper]  [DOI:10.1145/3269206.3269229]
  • Michael Loster, Manuel Hegner, Felix Naumann, Ulf Leser: Dissecting Company Names using Sequence Labeling. Lernen, Wissen, Daten, Analysen (LWDA), 2018
    [Paper]  [Paper] 
  • Alberto Pietrangelo, Giovanni Simonini, Sonia Bergamaschi, Felix Naumann, Ioannis Koumarelas: Towards Progressive Search-driven Entity Resolution. Italian Symposium on Advanced Database Systems (SEBD), 2018
    [Paper]  [Paper] 
  • Ioannis Koumarelas, Axel Kroschk, Clifford Mosley, Felix Naumann: Experience: Enhancing Address Matching with Geocoding and Similarity Measure Selection. Journal of Data and Information Quality (JDIQ) 10:(2), 2018
    [Paper]  [DOI:10.1145/3232852]
  • Michael Loster, Tim Repke, Ralf Krestel, Felix Naumann, Jan Ehmueller, Benjamin Feldmann, Oliver Maspfuhl: The Challenges of Creating, Maintaining and Exploring Graphs of Financial Entities. Proceedings of the Fourth International Workshop on Data Science for Macro-Modeling (DSMM), 2018
    [Paper]  [DOI:10.1145/3220547.3220553]
  • Ziawasch Abedjan, Lukasz Golab, Felix Naumann, Thorsten Papenbrock: Data Profiling. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2018
    [M&C]  [DOI:10.2200/S00878ED1V01Y201810DTM052]
  • Tobias Bleifuß, Leon Bornemann, Theodore Johnson, Dmitri V. Kalashnikov, Felix Naumann, Divesh Srivastava: Exploring Change - A New Dimension of Data Analytics. PVLDB 12:(2), 2018
    [Paper]  [PVLDB]  [DOI:10.14778/3282495.3282496]
  • Julian Risch, Samuele Garda, Ralf Krestel: Book Recommendation Beyond the Usual Suspects: Embedding Book Plots Together with Place and Time Information. Proceedings of the International Conference On Asia-Pacific Digital Libraries (ICADL), 2018
    [Paper]  [GitHub] 
  • Julian Risch, Eva Krebs, Alexander Löser, Alexander Riese, Ralf Krestel: Fine-Grained Classification of Offensive Language. Proceedings of GermEval (co-located with KONVENS), 2018
    [Paper] 
  • Julian Risch, Ralf Krestel: Learning Patent Speak: Investigating Domain-Specific Word Embeddings. Proceedings of the Thirteenth International Conference on Digital Information Management (ICDIM), 2018
    [Paper]  [Project Page] 
  • Betty van Aken, Julian Risch, Ralf Krestel, Alexander Löser: Challenges for Toxic Comment Classification: An In-Depth Error Analysis. Proceedings of the Workshop on Abusive Language Online (co-located with EMNLP), 2018
    [Paper] 
  • Tim Repke, Ralf Krestel, Jakob Edding, Moritz Hartmann, Jonas Hering, Dennis Kipping, Hendrik Schmidt, Nico Scordialo, Alexander Zenner: Beacon in the Dark: A System for Interactive Exploration of Large Email Corpora. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2018
    [Paper v1]  [Paper v2]  [Project]  [DOI:10.1145/3269206.3269231]
  • Divy Agrawal, Sanjay Chawla, Zoi Kaoudi, Sebastian Kruse, Jorge Arnulfo Quiané-Ruiz, Bertty Contreras-Rojas, Ahmed Elmagarmid, Yasser Idris, Ji Lucas, Essam Mansour, Mourad Ouzzani, Paolo Papotti, Nan Tang, Saravanan Thirumuruganathan, Anis Troudi: RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -. PVLDB 11:(11), 2018
    [Paper]  [DOI:10.14778/3236187.3236195]
  • Claudia Exeler, Maria Graber, Tino Junge, Stefan Ramson, Cathleen Ramson, Fabian Tschirschnitz, Felix Naumann: Piggyback Profiling: Enhancing Query Results with Metadata. Lernen, Wissen, Daten, Analysen (LWDA), 2018
    [Paper] 
  • Julian Risch, Ralf Krestel: Aggression Identification Using Deep Learning and Data Augmentation. Proceedings of the Workshop on Trolling, Aggression and Cyberbullying (TRAC@COLING), 2018
    [Paper]  [GitHub] 
  • Julian Risch, Ralf Krestel: Delete or not Delete? Semi-Automatic Comment Moderation for the Newsroom. Proceedings of the Workshop on Trolling, Aggression and Cyberbullying (TRAC@COLING), 2018
    [Paper] 
  • Leon Bornemann, Tobias Bleifuß, Dmitri Kalashnikov, Felix Naumann, Divesh Srivastava: Data Change Exploration using Time Series Clustering. Datenbank-Spektrum 18:(2), 2018
    [Paper]  [DOI:10.1007/s13222-018-0285-x]
  • Stefan Bunk, Ralf Krestel: WELDA: Enhancing Topic Models by Incorporating Local Word Contexts. Proceedings of the Joint Conference on Digital Libraries (JCDL), 2018
    [Paper] 
  • Carl Ambroselli, Julian Risch, Ralf Krestel, Andreas Loos: Prediction for the Newsroom: Which Articles Will Get the Most Comments?. Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2018
    [Paper]  [GitHub] 
  • Konstantina Lazaridou, Toni Gruetze, Felix Naumann: Where in the World Is Carmen Sandiego? Detecting Person Locations via Social Media Discussions. Proceedings of the ACM Conference on Web Science, 2018
    [Paper]  [URL] 
  • Sebastian Kruse, Felix Naumann: Efficient Discovery of Approximate Dependencies. PVLDB 11:(7), 2018
    [Paper] 
  • Julian Risch, Ralf Krestel: My Approach = Your Apparatus? Entropy-Based Topic Modeling on Multiple Domain-Specific Text Collections. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL), 2018
    [Paper]  [GitHub] 
  • Laure Berti-Equille, Hazar Harmouch, Felix Naumann, Noel Novelli, Saravanan Thirumuruganathan: Discovery of Genuine Functional Dependencies from Relational Data with Missing Values. PVLDB, 2018
    [Paper]  [Paper]  [DOI:10.14778/3204028.3204032]
  • Tim Repke, Ralf Krestel: Topic-aware Network Visualisation to Explore Large Email Corpora. International Workshop on Big Data Visual Exploration and Analytics (BigVis), 2018
    [Paper]  [Project] 
  • Shazia Sadiq, Tamraparni Dasu, Xin Luna Dong, Juliana Freire, Ihab F. Ilyas, Sebastian Link, Renée J. Miller, Felix Naumann, Xiaofang Zhou, Divesh Srivastava: Data Quality – The Role of Empiricism. SIGMOD Record 46:(4), 2018
    [Paper] 
  • Tim Repke, Ralf Krestel: Bringing Back Structure to Free Text Email Conversations with Recurrent Neural Networks. Proceedings of the European Conference on Information Retrieval (ECIR), 2018
    [Paper]  [Project]  [DOI:10.1007/978-3-319-76941-7_9]


2017


  • Hazar Harmouch, Felix Naumann: Cardinality Estimation: An Experimental Survey. PVLDB, 2017
    [Paper]  [Paper]  [DOI:10.1145/3164135.3164145]
  • Fabian Tschirschnitz, Thorsten Papenbrock, Felix Naumann: Detecting Inclusion Dependencies on Very Many Tables. Transactions on Database Systems (TODS) 42:(3), 2017
    [Paper]  [DOI:10.1145/3105959]
  • David Heller, Ralf Krestel, Uwe Ohler, Martin Vingron, Annalisa Marsico: ssHMM: Extracting Intuitive Sequence-Structure Motifs from High-Throughput RNA-Binding Protein Data. Nucleic Acid Research 45:(19), 2017
    [DOI:10.1093/nar/gkx756]
  • Tobias Bleifuß, Sebastian Kruse, Felix Naumann: Efficient Denial Constraint Discovery with Hydra. PVLDB 11:(3), 2017
    [Paper]  [PVLDB]  [DOI:10.14778/3157794.3157800]
  • M. Jürgen Giesler, Bettina Keller, Tim Repke, Rainer Leonhart, Joachim Weis, Rebecca Muckelbauer, Nina Rieckmann, Jacqueline Müller-Nordhorn, Gabriele Lucius-Hoene, Christine Holmberg: Effect of a Website That Presents Patients' Experiences on Self-Efficacy and Patient Competence of Colorectal Cancer Patients: Web-Based Randomized Controlled Trial. Journal of Medical Internet Research (JMIR) 19:(10), 2017
    [JMIR]  [DOI:10.2196/jmir.7639]
  • Konstantina Lazaridou, Ralf Krestel, Felix Naumann: Identifying Media Bias by Analyzing Reported Speech. Proceedings of the International Conference on Data Mining (ICDM), 2017
    [IEEE]  [DOI:https://ieeexplore.ieee.org/document/8215582]
  • Fabian Maschler, Fabio Niephaus, Julian Risch: Real or Fake? Large-Scale Validation of Identity Leaks. Jahrestagung der Gesellschaft für Informatik (INFORMATIK), 2017
    [Paper] 
  • Zhe Zuo, Michael Loster, Ralf Krestel, Felix Naumann: Uncovering Business Relationships: Context-sensitive Relationship Extraction for Difficult Relationship Types. Lernen, Wissen, Daten, Analysen (LWDA), 2017
    [Paper] 
  • Ralf Krestel, Julian Risch: How Do Search Engines Work? A Massive Open Online Course with 4000 Participants. Lernen, Wissen, Daten, Analysen (LWDA), 2017
    [Paper] 
  • Michael Loster, Zhe Zuo, Felix Naumann, Oliver Maspfuhl, Dirk Thomas: Improving Company Recognition from Unstructured Text by using Dictionaries. Proceedings of the International Conference on Extending Database Technology, 2017
    [Paper]  [DOI:10.5441/002/edbt.2017.82]
  • Julian Risch, Ralf Krestel: What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers. Proceedings of the International Conference on Theory and Practice of Digital Libraries (TPDL), 2017
    [Paper]  [GitHub] 
  • Tobias Bleifuß, Theodore Johnson, Dmitri V. Kalashnikov, Felix Naumann, Vladislav Shkapenyuk, Divesh Srivastava: Enabling Change Exploration (Vision). Proceedings of the Fourth International Workshop on Exploratory Search in Databases and the Web (ExploreDB), 2017
    [Paper]  [DOI:10.1145/3077331.3077340]
  • Sebastian Kruse, Thorsten Papenbrock, Christian Dullweber, Moritz Finke, Manuel Hegner, Martin Zabel, Christian Zöllner, Felix Naumann: Fast Approximate Discovery of Inclusion Dependencies. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW), 2017
    [Paper] 
  • Thorsten Papenbrock, Felix Naumann: A Hybrid Approach for Efficient Unique Column Combination Discovery. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW), 2017
    [Paper] 
  • Thorsten Papenbrock, Felix Naumann: Data-driven Schema Normalization. Proceedings of the International Conference on Extending Database Technology (EDBT), 2017
    [Paper]  [DOI:10.5441/002/edbt.2017.31]
  • Felix Naumann, Ralf Krestel: Das Fachgebiet „Informationssysteme“ am Hasso-Plattner-Institut. Datenbank-Spektrum 17:(1), 2017
    [Paper]  [URL] 
  • Toni Gruetze, Ralf Krestel, Konstantina Lazaridou, Felix Naumann: What was Hillary Clinton doing in Katy, Texas?. Proceedings of the International Conference on World Wide Web (WWW), 2017
    [Paper] 
  • Tim Repke, Michael Loster, Ralf Krestel: Comparing Features for Ranking Relationships Between Financial Entities Based on Text. Proceedings of the International Workshop on Data Science for Macro-Modeling with Financial and Economic Datasets (DSMM), 2017
    [Paper]  [Poster]  [Slides]  [DOI:10.1145/3077240.3077252]
  • Ziawasch Abedjan, Lukasz Golab, Felix Naumann: Data Profiling (tutorial). Proceedings of the International Conference on Management of Data (SIGMOD), 2017
    [Paper] 


2016


  • Lan Jiang, Hengyang Lu, Ming Xu, Chongjun Wang: Biterm pseudo document topic model for short text. Proceedings of the International Conference on Tools with Artificial Intelligence (ICTAI), 2016
    [Paper]  [IEEE]  [DOI:10.1109/ICTAI.2016.0134]
  • Tim Repke: Extraction Of Citation Data From Websites Based On Visual Cues. , 2016
    [Thesis] 
  • Ahmad Samiei, Felix Naumann: Cluster-based Sorted Neighborhood for Efficient Duplicate Detection. International Conference on Data Mining Workshops (ICDMW), 2016
    [URL] 
  • Tobias Bleifuß, Susanne Bülow, Johannes Frohnhofen, Julian Risch, Georg Wiese, Sebastian Kruse, Thorsten Papenbrock, Felix Naumann: Approximate Discovery of Functional Dependencies for Large Datasets. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2016
    [Paper]  [DOI:10.1145/2983323.2983781]
  • Divy Agrawal, Lamine Ba, Laure Berti-Equille, Sanjay Chawla, Ahmed Elmagarmid, Hossam Hammady, Yasser Idris, Zoi Kaoudi, Zuhair Khayyat, Sebastian Kruse, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Mohammed J. Zaki: Rheem: Enabling Multi-Platform Task Execution (demo). Proceedings of the ACM Conference on Management of Data (SIGMOD), 2016
    [Paper] 
  • Ahmad Samiei, Ioannis Koumarelas, Michael Loster, Felix Naumann: Combination of Rule-based and Textual Similarity Approaches to Match Financial Entities. Data Science for Macro-Modeling with Financial and Economic Datasets (DSMM), 2016
    [Paper]  [URL] 
  • Jens Ehrlich, Mandy Roick, Lukas Schulze, Jakob Zwiener, Thorsten Papenbrock, Felix Naumann: Holistic Data Profiling: Simultaneous Discovery of Various Metadata. Proceedings of the International Conference on Extending Database Technology (EDBT), 2016
    [Paper]  [Paper] 
  • Christian Godde, Konstantina Lazaridou, Ralf Krestel: Classification of German Newspaper Comments. Lernen, Wissen, Daten, Analysen (LWDA), 2016
    [Paper] 
  • Konstantina Lazaridou, Ralf Krestel: Identifying Political Bias in News Articles. International Conference on Theory and Practice of Digital Libraries. IEEE Technical Committee on Digital Libraries, 2016
    [Paper] 
  • Sebastian Kruse, Anja Jentzsch, Thorsten Papenbrock, Zoi Kaoudi, Jorge-Arnulfo Quiane-Ruiz, Felix Naumann: RDFind: Scalable Conditional Inclusion Dependency Discovery in RDF Datasets. Proceedings of the International Conference on Management of Data (SIGMOD), 2016
    [Paper]  [DOI:10.1145/2882903.2915206]
  • Sebastian Kruse, Thorsten Papenbrock, Hazar Harmouch, Felix Naumann: Data Anamnesis: Admitting Raw Data into an Organization. Data Engineering Bulletin 39:(2), 2016
    [Paper] 
  • Thorsten Papenbrock, Felix Naumann: A Hybrid Approach to Functional Dependency Discovery. Proceedings of the International Conference on Management of Data (SIGMOD), 2016
    [Paper]  [DOI:10.1145/2882903.2915203]
  • Maximilian Grundke, Johannes Jasper, Mariya Perchyk, Jan Philipp Sachse, Ralf Krestel, Mariana Neves: TextAI: Enhancing TextAE with Intelligent Annotation Support. Proceedings of the International Symposium on Semantic Mining in Biomedicine (SMBM), 2016
    [Paper]  [DOI:10.1007/978-3-319-41754-7_18]
  • Jihyun Park, Margaret Blume-Kohout, Ralf Krestel, Eric Nalisnick, Padhraic Smyth: Analyzing NIH Funding Patterns over Time with Statistical Text Analysis. Scholarly Big Data: AI Perspectives, Challenges, and Ideas (SBD) Workshop at AAAI, 2016
    [Paper] 
  • Ralf Krestel, Davide Mottin, Emmanuel Müller: Proceedings of the Conference "Lernen, Wissen, Daten, Analysen", Potsdam, Germany, September 12-14, 2016. CEUR Workshop Proceedings. CEUR-WS.org, 2016
  • Maximilian Jenders, Ralf Krestel, Felix Naumann: Which Answer is Best? Predicting Accepted Answers in MOOC Forums. Proceedings of the International Conference Companion on World Wide Web, 2016
    [Paper] 
  • Toni Gruetze, Ralf Krestel, Felix Naumann: Topic Shifts in StackOverflow: Ask it like Socrates. Lecture Notes in Computer Science, 2016
    [Paper]  [DOI:10.1007/978-3-319-41754-7_18]
  • Felix Naumann, Ralf Krestel: The Information Systems Group at HPI. SIGMOD Record (2016)
    [Paper] 
  • Jennifer Engler, Sandra Adami, Yvonne Adam, Bettina Keller, Tim Repke, Hella Fügemann, Gabriele Lucius-Hoene, Jacqueline Müller-Nordhorn, Christine Holmberg: Using others’ experiences. Cancer patients’ expectations and navigation of a website providing narratives on prostate, breast and colorectal cancer. Patient Education and Counseling 99:(8), 2016
    [ScienceDirect]  [DOI:10.1016/j.pec.2016.03.015]
  • Toni Gruetze, Gjergji Kasneci, Zhe Zuo, Felix Naumann: CohEEL: Coherent and Efficient Named Entity Linking through Random Walks. Web Semantics: Science, Services and Agents on the World Wide Web 37:(C), 2016
    [Paper]  [DOI:10.1016/j.websem.2016.03.001]
  • Philipp Langer, Felix Naumann: Efficient Order Dependency Discovery. The VLDB Journal 25:(2), 2016
    [DOI:10.1007/s00778-015-0412-3]
  • Lukasz Golab Ziawasch Abedjan, Felix Naumann: Data Profiling (tutorial). Proceedings of the International Conference on Data Engineering (ICDE), 2016
    [Paper] 


2015


  • Patrick Hennig, Philipp Berger, Christian Dullweber, Moritz Finke, Fabian Maschler, Julian Risch, Christoph Meinel: Social Media Story Telling. Proceedings of the International Conference on Social Computing and Networking (SocialCom), 2015
    [Paper] 
  • Dominik Schmidt, Johannes Frohnhofen, Sven Knebel, Florian Meinel, Mariya Perchyk, Julian Risch, Jonathan Striebel, Julia Wachtel, Patrick Baudisch: Ergonomic Interaction for Touch Floors. Proceedings of the Conference on Human Factors in Computing Systems (CHI), 2015
    [Paper]  [DOI:10.1145/2702123.2702254]
  • Ralf Krestel, Thomas Werkmeister, Timur Pratama Wiradarma, Gjergji Kasneci: Tweet-Recommender: Finding Relevant Tweets for News Articles. Proceedings of the International World Wide Web Conference (WWW), 2015
    [Paper] 
  • Thorsten Papenbrock, Arvid Heise, Felix Naumann: Progressive Duplicate Detection. IEEE Transactions on Knowledge and Data Engineering (TKDE) 27:(5), 2015
    [Paper]  [DOI:10.1109/TKDE.2014.2359666]
  • Sebastian Kruse, Thorsten Papenbrock, Felix Naumann: Scaling Out the Discovery of Inclusion Dependencies. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW), 2015
    [Paper] 
  • Thorsten Papenbrock, Sebastian Kruse, Jorge-Arnulfo Quiane-Ruiz, Felix Naumann: Divide & Conquer-based Inclusion Dependency Discovery. PVLDB 8:(7), 2015
    [Paper]  [DOI:10.14778/2824032.2824086]
  • Thorsten Papenbrock, Tanja Bergmann, Moritz Finke, Jakob Zwiener, Felix Naumann: Data Profiling with Metanome. PVLDB 8:(12), 2015
    [Paper]  [DOI:10.14778/2824032.2824086]
  • Thorsten Papenbrock, Jens Ehrlich, Jannik Marten, Tommy Neubert, Jan-Peer Rudolph, Martin Schönberg, Jakob Zwiener, Felix Naumann: Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms. PVLDB 8:(10), 2015
    [Paper]  [DOI:10.14778/2794367.2794377]
  • Ralf Krestel, Nima Dokoohaki: Diversifying Customer Review Rankings. Neural Networks (2015)
    [DOI:10.1016/j.neunet.2015.02.008]
  • Tobias Schubotz, Ralf Krestel: Online Temporal Summarization of News Events. Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2015
    [Paper] 
  • Toni Gruetze, Gary Yao, Ralf Krestel: Learning Temporal Tagging Behaviour. Proceedings of the International Conference on World Wide Web Companion (WWW), 2015
    [Paper]  [DOI:10.1145/2740908.2741701]
  • Mandy Roick, Maximilian Jenders, Ralf Krestel: How to Stay Up-to-date on Twitter with General Keywords. Proceedings of the LWA Workshops: KDML, FGWM, IR, and FGDB, 2015
    [Paper] 
  • Maximilian Jenders, Thorben Lindhauer, Gjergji Kasneci, Ralf Krestel, Felix Naumann: A Serendipity Model For News Recommendation. KI: Advances in Artificial Intelligence - Annual German Conference on AI, 2015
    [Paper] 
  • Ziawasch Abedjan, Lukasz Golab, Felix Naumann: Profiling relational data: a survey. The VLDB Journal 24:(4), 2015
    [Paper]  [DOI:10.1007/s00778-015-0389-y]
  • Anja Jentzsch, Hannes Mühleisen, Felix Naumann: Uniqueness, Density, and Keyness: Exploring Class Hierarchies. In Proceedings of International Workshop on Consuming Linked Data (COLD), ISWC, 2015
    [Paper] 
  • Anja Jentzsch, Christian Dullweber, Pierpaolo Troiano, Felix Naumann: Exploring Linked Data Graph Structures. Proceedings of the International Semantic Web Conference, Posters and Demos (ISWC), 2015
    [Paper] 
  • Astrid Rheinländer, Arvid Heise, Fabian Hueske, Ulf Leser, Felix Naumann: SOFA: An Extensible Logical Optimizer for UDF-heavy Data Flows. Information Systems (2015)
  • Sebastian Kruse, Paolo Papotti, Felix Naumann: Estimating Data Integration and Cleaning Effort. Proceedings of the International Conference on Extending Database Technology (EDBT), 2015
    [Paper] 


2014


  • Jun Yang, Lan Jiang, Chongjun Wang, Junyuan Xie: Multi-label emotion classification for tweets in weibo: Method and application. Proceedings of the International Conference on Tools with Artificial Intelligence (ICTAI), 2014
    [IEEE]  [DOI:10.1109/ICTAI.2014.71]
  • Astrid Rheinländer, Martin Beckmann, Anja Kunkel, Arvid Heise, Thomas Stoltmann, Ulf Leser: Versatile optimization of UDF-heavy data flows with SOFA (demo). Proceedings of the International Conference on Management of Data (SIGMOD), 2014
    [Paper]  [DOI:10.1145/2588555.2594517]
  • Alexander Alexandrov, Rico Bergmann, Stephan Ewen, Johann-Christoph Freytag, Fabian Hueske, Arvid Heise, Odej Kao, Marcus Leich, Ulf Leser, Volker Markl, Felix Naumann, Mathias Peters, Astrid Rheinländer, Matthias J. Sax, Sebastian Schelter, Mareike Höger, Kostas Tzoumas, Daniel Warneke: The Stratosphere Platform for Big Data Analytics. The VLDB Journal 23:(6), 2014
    [Paper] 
  • Benedikt Forchhammer, Anja Jentzsch, Felix Naumann: LODOP - Multi-Query Optimization for Linked Data Profiling Queries. Proceedings of the Extended Semantic Web Conference (ESWC), 2014
    [Paper] 
  • Ralf Krestel, Sabine Bergler, René Witte: Modeling human newspaper readers: The Fuzzy Believer approach. Natural Language Engineering 20:(2), 2014
    [Paper]  [DOI:10.1017/S1351324912000289]
  • Ziawasch Abedjan, Jorge-Arnulfo Quanie-Ruiz, Felix Naumann: Detecting Unique Column Combinations on Dynamic Data. Proceedings of the International Conference on Data Engineering (ICDE), 2014
    [Paper] 
  • Andreas Meyer, Luise Pufahl, Kimon Batoulis, Sebastian Kruse, Thorben Lindhauer, Thomas Stoff, Dirk Fahland, Mathias Weske: Data Perspective in Process Choreographies: Modeling and Execution. International Conference on Advanced Information Systems Engineering, 2014
  • Philipp Langer, Patrick Schulze, Stefan George, Matthias Kohnen, Tobias Metzke, Ziawasch Abedjan, Gjergji Kasneci: Assigning Global Relevance Scores to DBpedia Facts. International Workshop on Data Engineering meets the Semantic Web (DESWeb), 2014
    [Paper] 
  • Toni Gruetze, Gjergji Kasneci, Zhe Zuo, Felix Naumann: Bootstrapping Wikipedia to Answer Ambiguous Person Name Queries. International Workshop on Information Integration on the Web (IIWeb), 2014
    [Paper] 
  • Ziawasch Abedjan, Patrick Schulze, Felix Naumann: DFD: Efficient Discovery of Functional Dependencies. In Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2014
    [Paper] 
  • Ziawasch Abedjan, Toni Gruetze, Anja Jentzsch, Felix Naumann: Profiling and Mining RDF Data with ProLOD++. Proceedings of the International Conference on Data Engineering (ICDE), 2014
    [Paper] 
  • Johannes Lorey: Identifying and Determining SPARQL Endpoint Characteristics. International Journal of Web Information Systems 10:(3), 2014
  • Tobias Vogel, Felix Naumann: Semi-Supervised Consensus Clustering: Reducing Human Effort. Proceedings of the International Workshop on Data Integration and Applications, 2014
    [Paper] 
  • Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N. Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick van Kleef, Sören Auer, Christian Bizer: DBpedia – A Large-scale, Multilingual Knowledge Base Extracted from Wikipedia. Semantic Web Journal (2014)
  • Zhe Zuo, Gjergji Kasneci, Toni Gruetze, Felix Naumann: BEL: Bagging for Entity Linking. 25th International Conference on Computational Linguistics (COLING), 2014
    [Paper] 
  • Arvid Heise, Gjergji Kasneci, Felix Naumann: Estimating the Number and Sizes of Fuzzy-Duplicate Clusters. Proceedings of the Conference on Information and Knowledge Management (CIKM), 2014
    [Paper] 
  • Ziawasch Abedjan, Felix Naumann: Amending RDF Entities with New Facts. Proceedings of the Extended Semantic Web Conference (ESWC), 2014
    [Paper] 
  • Tobias Vogel, Arvid Heise, Uwe Draisbach, Dustin Lange, Felix Naumann: Reach for Gold: An Annealing Standard to Evaluate Duplicate Detection Results. Journal of Data and Information Quality (JDIQ) 5:(1-2), 2014
    [Paper] 


2013


  • Johannes Lorey: Storing and Provisioning Linked Data as a Service. Proceedings of the Extended Semantic Web Conference (ESWC), 2013
    [Paper] 
  • Ziawasch Abedjan, Felix Naumann: Improving RDF Data through Association Rule Mining. Datenbank-Spektrum (Special Issue on RDF Data Management) 13:(2), 2013
    [Paper] 
  • Johannes Lorey, Felix Naumann: Detecting SPARQL Query Templates for Data Prefetching. Proceedings of the Extended Semantic Web Conference (ESWC), 2013
    [Paper] 
  • Johannes Lorey, Felix Naumann: Caching and Prefetching Strategies for SPARQL Queries. Proceedings of the Extended Semantic Web Conference (ESWC), 2013
    [Paper] 
  • Maximilian Jenders, Gjergji Kasneci, Felix Naumann: Analyzing and Predicting Viral Tweets. Proceedings of the International World Wide Web Conference (WWW), 2013
    [Paper] 
  • Marcus Leich, Jochen Adamek, Moritz Schubotz, Arvid Heise, Astrid Rheinlander, Volker Markl: Applying Stratosphere for Big Data Analytics. Database Systems for Business, Technology, and Web (BTW), 2013
    [Paper] 
  • Saeedeh Momtazi, Felix Naumann: Topic modeling for expert finding using latent dirichlet allocation. WIREs Data Mining and Knowledge Discovery 3:(5), 2013
    [Paper] 
  • Ziawasch Abedjan, Felix Naumann: Synonym Analysis for Predicate Expansion. Proceedings of the Extended Semantic Web Conference (ESWC), 2013
    [Paper] 
  • Johannes Lorey: SPARQL Endpoint Metrics for Quality-Aware Linked Data Consumption. Proceedings of the International Conference on Information Integration and Web-based Applications & Services (iiWAS), 2013
    [Paper] 
  • Daniel Rinser, Dustin Lange, Felix Naumann: Cross-lingual Entity Matching and Infobox Alignment in Wikipedia. Information Systems (IS) 38:(6), 2013
    [Paper] 
  • Felix Naumann, Maximilian Jenders, Thorsten Papenbrock: Ein Datenbankkurs mit 6000 Teilnehmern - Erfahrungen auf der openHPI MOOC Plattform. Informatik-Spektrum 37:(12), 2013
    [Paper]  [DOI:10.1007/s00287-013-0750-8]
  • Benedikt Forchhammer, Thorsten Papenbrock, Thomas Stening, Sven Viehmeier, Uwe Draisbach, Felix Naumann: Duplicate Detection on GPUs. Proceedings of the conference on Database Systems for Business, Technology, and Web (BTW), 2013
    [Paper] 
  • Simon Lacoste-Julien, Konstantina Palla, Alex Davies, Gjergji Kasneci, Thore Graepel, Zoubin Ghahramani: SiGMa: Simple Greedy Matching for Aligning Large Knowledge Bases. Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2013
  • Arvid Heise, Jorge-Arnulfo Quiane-Ruiz, Ziawasch Abedjan, Anja Jentzsch, Felix Naumann: Scalable Discovery of Unique Column Combinations. PVLDB, 2013
    [Paper] 
  • Johannes Lorey, Felix Naumann: Caching and Prefetching Strategies for SPARQL Queries. Proceedings of the International Workshop on Usage Analysis and the Web of Data (USEWOD), 2013
    [Paper] 
  • Dustin Lange, Felix Naumann: Cost-Aware Query Planning for Similarity Search. Information Systems (IS) 38:(4), 2013
    [Paper] 
  • Dustin Lange, Felix Naumann: Bulk Sorted Access for Efficient Top-k Retrieval. Proceedings of the International Conference on Scientific and Statistical Database Management (SSDBM), 2013
    [Paper] 
  • Alexander Albrecht, Felix Naumann: Systematic ETL Management – Experiences with High-Level Operators. Proceedings of the International Conference on Information Quality (ICIQ), 2013
    [Paper] 
  • Astrid Rheinländer, Arvid Heise, Fabian Hueske, Ulf Leser, Felix Naumann: SOFA: An Extensible Logical Optimizer for UDF-heavy Dataflows. , 2013
    [] 
  • Uwe Draisbach, Felix Naumann: On Choosing Thresholds for Duplicate Detection. Proceedings of the International Conference on Information Quality (ICIQ), 2013
    [Paper] 
  • Felix Naumann: Data Profiling Revisited. SIGMOD Record 32:(4), 2013
    [Paper] 


2012


  • Dandy Fenz, Dustin Lange, Astrid Rheinländer, Felix Naumann, Ulf Leser: Efficient Similarity Search in Very Large String Sets. Proceedings of the International Conference on Scientific and Statistical DatabaseManagement (SSDBM), 2012
    [Paper] 
  • Alexander Albrecht, Felix Naumann: Schema Decryption for Large Extract-Transform-Load Systems. Proceedings of the International Conference on Conceptual Modeling (ER), 2012
    [Paper] 
  • Arvid Heise, Felix Naumann: Integrating Open Government Data with Stratosphere for more Transparency. Web Semantics: Science, Services and Agents on the World Wide Web 14:(1), 2012
    [Paper]  [DOI:10.1016/j.websem.2012.02.002]
  • George Beskales, Gautam Das, Ahmed K. Elmagarmid, Ihab F. Ilyas, Felix Naumann, Mourad Ouzzani, Paolo Papotti, Jorge Quiane-Ruiz, Nan Tang: The Data Analytics Group at the Qatar Computing Research Institute. SIGMOD Record 41:(4), 2012
  • Tobias Vogel, Felix Naumann: Automatic Blocking Key Selection for Duplicate Detection based on Unigram Combinations. Proceedings of the International Workshop on Quality in Databases (QDB) in conjunction with VLDB, 2012
    [Paper] 
  • Martin Köppelmann, Dustin Lange, Claudia Lehmann, Marika Marszalkowski, Felix Naumann, Peter Retzlaff, Sebastian Stange, Lea Voget: Scalable Similarity Search with Dynamic Similarity Measures. Proceedings of the International Workshop on Ranking in Databases (DBRank) in conjunction with VLDB, 2012
    [Paper] 
  • Melanie Herschel, Felix Naumann, Sascha Szott, Maik Taubert: Scalable Iterative Graph Duplicate Detection. Transactions on Knowledge and Data Engineering (TKDE) 24:(11), 2012
  • Christoph Böhm, Gjergji Kasneci, Felix Naumann: Latent Topics in Graph-Structured Data. Proceedings of the Conference on Information and Knowledge Management (CIKM), 2012
    [Paper] 
  • Jana Bauckmann, Ziawasch Abedjan, Heiko Müller, Ulf Leser, Felix Naumann: Discovering Conditional Inclusion Dependencies. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2012
  • Alexander Albrecht, Felix Naumann: Understanding Cryptic Schemata in Large Extract-Transform-Load Systems. Hasso-Plattner-Institut für Softwaresystemtechnik an der Universität Potsdam, 2012
  • Saeedeh Momtazi: Fine-grained German Sentiment Analysis on Social Media. Proceedings of the International Conference on Language Resources and Evaluation (LREC), 2012
  • Alberto Abelló, Jérôme Darmont, Lorena Etcheverry, Matteo Golfarelli, Jose-Norberto Mazón, Felix Naumann, Torben Bach Pedersen, Stefano Rizzi, Juan Trujillo, Panos Vassiliadis, Gottfried Vossen: Fusion Cubes: Towards Self-Service Business Intelligence. International Journal of Data Warehousing and Mining (IJDWM) 9:(2), 2012
    [DOI:10.4018/jdwm.2013040104]
  • Toni Gruetze, Christoph Böhm, Felix Naumann: Holistic and Scalable Ontology Alignment for Linked Open Data. Proceedings of the Linked Data on the Web (LDOW) Workshop at the International World Wide Web Conference (WWW), 2012
    [Paper] 
  • Enkelejda Tafaj, Gjergji Kasneci, Wolfgang Rosenstiel, Martin Bogdan: Bayesian online clustering of eye movement data. Proceedings of the Symposium on Eye-Tracking Research and Applications, 2012
    [Paper]  [DOI:10.1145/2168556.2168617]
  • Uwe Draisbach, Felix Naumann, Sascha Szott, Oliver Wonneberg: Adaptive Windows for Duplicate Detection. Proceedings of the International Conference on Data Engineering (ICDE), 2012
    [Paper] 
  • Christoph Böhm, Markus Freitag, Arvid Heise, Claudia Lehmann, Andrina Mascher, Felix Naumann, Mauricio Hernandez, Vuk Ercegovac, Peter Haase: GovWILD: Integrating Open Government Data for Transparency (demo). Proceedings of the International World Wide Web Conference (WWW), 2012
  • Ziawasch Abedjan, Johannes Lorey, Felix Naumann: Reconciling Ontologies and the Web of Data. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2012
  • Jana Bauckmann, Ziawasch Abedjan, Ulf Leser, Heiko Müller, Felix Naumann: Covering or complete? : discovering conditional inclusion dependencies. Hasso-Plattner-Institut für Softwaresystemtechnik an der Universität Potsdam, 2012
  • Christoph Böhm, Gerard de Melo, Felix Naumann, Gerhard Weikum: LINDA: Distributed Web-of-Data-Scale Entity Matching. Proceedings of the International Conference on Information and Knowledge Management (CIKM), Maui, Hawaii, 2012
  • Uwe Draisbach: Partitionierung zur effizienten Duplikaterkennung in relationalen Daten. Ausgezeichnete Arbeiten zur Informationsqualität. Springer Vieweg, 2012
  • Christoph Böhm, Daniel Hefenbrock, Felix Naumann: Scalable Peer-to-Peer-based RDF Management. Proceedings of the Int. Conference on Semantic Systems, 2012
    [Paper] 
  • Gjergji Kasneci: Reasoning about Knowledge from the Web - (Extended Abstract). ICWE Workshops, 2012
    [Paper]  [DOI:10.1007/978-3-642-35623-0_19]
  • Arvid Heise, Astrid Rheinländer, Marcus Leich, Ulf Leser, Felix Naumann: Meteor/Sopremo: An Extensible Query Language and Operator Model. Proceedings of the International Workshop on End-to-end Management of Big Data (BigData) in conjunction with VLDB, 2012
    [Paper] 
  • Uwe Draisbach, Felix Naumann: Adaptive Windows for Duplicate Detection. Hasso-Plattner-Institut für Softwaresystemtechnik an der Universität Potsdam, 2012
    [Paper] 


2011


  • Ziawasch Abedjan, Felix Naumann: Advancing the Discovery of Unique Column Combinations. Proceedings of the International Conference on Information and Knowledge Management (CIKM), 2011
    [Paper] 
  • Johannes Lorey, Ziawasch Abedjan, Felix Naumann, Christoph Böhm: RDF Ontology (Re-)Engineering through Large-scale Data Mining. Billion Triples Challenge (BTC) at the International Semantic Web Conference (ISWC), 2011
    [Paper] 
  • Johannes Lorey, Felix Naumann, Benedikt Forchhammer, Andrina Mascher, Peter Retzlaff, Armin ZamaniFarahani, Soeren Discher, Cindy Faehnrich, Stefan Lemme, Thorsten Papenbrock, Robert Christoph Peschel, Stephan Richter, Thomas Stening, Sven Viehmeier: Black Swan: Augmenting Statistics with Event Data. Proceedings of the Conference on Information and Knowledge Management (CIKM), 2011
    [Paper] 
  • Tobias Vogel, Felix Naumann: Instance-based one-to-some Assignment of Similarity Measures to Attributes. Proceedings of the International Conference on Cooperative Information Systems (CoopIS), 2011
    [Paper] 
  • Dustin Lange, Tobias Vogel, Uwe Draisbach, Felix Naumann: Projektseminar "Similarity Search Algorithms". Datenbank-Spektrum 11:(1), 2011
    [Paper] 
  • Christoph Böhm, Eyk Kny, Benjamin Emde, Ziawasch Abedjan, Felix Naumann: SPRINT: ranking search results by paths. Proceedings of the International Conference on Extending Database Technology (EDBT), 2011
    [URL] 
  • Ziawasch Abedjan, Felix Naumann: Advancing the Discovery of Unique Column Combinations. Hasso-Plattner-Institut für Softwaresystemtechnik an der Universität Potsdam, 2011
  • Dustin Lange, Felix Naumann: Frequency-aware Similarity Measures. Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), 2011
    [Paper] 
  • Ziawasch Abedjan, Felix Naumann: Context and Target Configurations for Mining RDF Data. International Workshop on Search & Mining Entity-Relationship Data (SMER), 2011
  • Uwe Draisbach, Felix Naumann: A Generalization of Blocking and Windowing Algorithms for Duplicate Detection. Proceedings of the International Conference on Data and Knowledge Engineering (ICDKE), 2011
    [Paper] 
  • Dustin Lange, Felix Naumann: Efficient Similarity Search: Arbitrary Similarity Measures, Arbitrary Composition. Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), 2011
    [Paper] 
  • Jens Bleiholder, Felix Naumann: Kurz erklärt: Datenfusion. Datenbank-Spektrum 11:(1), 2011
  • Jens Bleiholder, Melanie Herschel, Felix Naumann: Eliminating NULLs with Subsumption and Complementation. Data Engineering Bulletin 34:(3), 2011
  • Mohammed AbuJarour, Felix Naumann: Improving Service Discovery through Enriched Service Descriptions. Datenbanksysteme für Business, Technologie und Web (BTW), 2011
  • Christoph Böhm, Johannes Lorey, Felix Naumann: Creating voiD Descriptions for Web-scale Data. Journal of Web Semantics: Science, Services and Agents on the World Wide Web 9:(3), 2011
    [Paper]  [DOI:10.1016/j.websem.2011.06.001]


2010


  • Christoph Böhm, Felix Naumann, Ziawasch Abedjan, Dandy Fenz, Toni Gruetze, Daniel Hefenbrock, Matthias Pohl, David Sonnabend: Profiling linked open data with ProLOD. Proceedings of the International Conference on Data Engineering (ICDE), 2010
    [Paper] 
  • Jana Bauckmann, Ulf Leser, Felix Naumann: Efficient and Exact Computation of Inclusion Dependencies for Data Integration. Hasso-Plattner-Institut für Softwaresystemtechnik an der Universität Potsdam, 2010
    [Paper] 
  • Dustin Lange, Christoph Böhm, Felix Naumann: Extracting structured information from Wikipedia articles to populate infoboxes. Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), 2010
    [Paper] 
  • Felix Naumann, Melanie Herschel: An Introduction to Duplicate Detection. Synthesis Lectures on Data Management. Morgan & Claypool Publishers, 2010
  • Mohammed AbuJarour, Felix Naumann: Dynamic tags for dynamic data web services. Proceedings of the Workshop on Emerging Web Services Technology (WEWST), 2010
  • Xin Luna Dong, Felix Naumann: Proceedings of the 13th International Conference on Extending Database Technology (EDBT), Lausanne, Switzerland. ACM International Conference Proceeding Series. ACM, 2010
  • Uwe Draisbach, Felix Naumann: DuDe: The Duplicate Detection Toolkit. Proceedings of the International Workshop on Quality in Databases (QDB), 2010
    [Paper] 
  • Johannes Lorey, Felix Naumann: Towards Granular Data Placement Strategies for Cloud Platforms. Proceedings of the International Conference on Granular Computing (GrC), 2010
    [Paper] 
  • Mohammed AbuJarour, Felix Naumann: Towards a diamond SOA operational model. IEEE International Conference on Service-Oriented Computing and Applications (SOCA), 2010
  • Xin Luna Dong, Felix Naumann: 13th International Workshop on the Web and Databases: WebDB 2010 (workshop report). SIGMOD Record 39:(3), 2010
  • Xin Luna Dong, Felix Naumann: Proceedings of the 13th International Workshop on the Web and Databases (WebDB), Indianapolis, IN. ACM, 2010
  • Dustin Lange, Christoph Böhm, Felix Naumann: Extracting structured information from Wikipedia articles to populate infoboxes. Hasso-Plattner-Institut für Softwaresystemtechnik an der Universität Potsdam, 2010
    [Paper] 
  • Mohammed AbuJarour, Felix Naumann, Mircea Craculeac: Collecting, Annotating, and Classifying Public Web Services. On the Move to Meaningful Internet Systems: OTM - Confederated International Conferences: CoopIS, IS, DOA and ODBASE, 2010
  • Christoph Böhm, Felix Naumann, Markus Freitag, Stefan George, Norman Höfler, Martin Köppelmann, Claudia Lehmann, Andrina Mascher, Tobias Schmidt: Linking open government data: what journalists wish they had known. Proceedings the International Conference on Semantic Systems (I-SEMANTICS), Graz, Austria, 2010
    [URL] 
  • Christoph Böhm, Johannes Lorey, Dandy Fenz, Eyk Kny, Matthias Pohl, Felix Naumann: Creating voiD Descriptions for Web-Scale Data. Billion Triples Challenge (BTC) at the International Semantic Web Conference (ISWC), 2010
    [Paper] 
  • Jens Bleiholder, Sascha Szott, Melanie Herschel, Felix Naumann: Complement union for data integration. Proceedings of the International Conference on Data Engineering (ICDE), 2010
    [Paper] 
  • Falk Brauer, Michael Huber, Gregor Hackenbroich, Ulf Leser, Felix Naumann, Wojciech M. Barczynski: Graph-based concept identification and disambiguation for enterprise search. Proceedings of the International Conference on World Wide Web (WWW), 2010
  • Tobias Vogel: Self-Adaptive Data Quality Web Services. Grundlagen von Datenbanken, 2010
    [Paper] 
  • Jens Bleiholder, Sascha Szott, Melanie Herschel, Frank Kaufer, Felix Naumann: Subsumption and complementation as data fusion operators. Proceedings of the International Conference on Extending Database Technology (EDBT), 2010


2009


  • Christoph Böhm, Philip Groth, Ulf Leser: Graph-Based Ontology Construction from Heterogeneous Evidences. Proceedings of the International Semantic Web Conference (ISWC), 2009
  • Xin Luna Dong, Felix Naumann: Data fusion - Resolving Data Conflicts for Integration (tutorial). PVLDB 2:(2), 2009
  • Alexandra Rostin, Oliver Albrecht, Jana Bauckmann, Felix Naumann, Ulf Leser: A Machine Learning Approach to Foreign Key Discovery. Proceedings of the International Workshop on the Web and Databases (WebDB), 2009
    [Paper] 
  • Uwe Draisbach, Felix Naumann: A Comparison and Generalization of Blocking and Windowing Algorithms for Duplicate Detection. Proceedings of the International Workshop on Quality in Databases (QDB), 2009
    [Paper] 
  • Mohammed AbuJarour, Mircea Craculeac, Falko Menge, Tobias Vogel, Jan-Felix Schwarz: POSR: A Comprehensive System for Aggregating and Using Web Services (demo). Proceedings of the IEEE Services Cup at IEEE International Conference on Web Services (ICWS), 2009
    [Paper] 
  • Tobias Vogel, Frank Kaufer, Felix Naumann: Encapsulating Multi-stepped Web Forms as Web Services. Proceedings of the International Conference on Service-Oriented Computing (ICSOC), 2009
    [Paper] 
  • Alexander Albrecht, Felix Naumann: METL: Managing and Integrating ETL Processes. Proceedings of the VLDB PhD Workshop, 2009
  • Felix Naumann, Louiqa Raschid: Guest Editorial for the Special Issue on Data Quality in Databases. Journal of Data and Information Quality (JDIQ) 1:(2), 2009


2008


  • Jens Bleiholder, Felix Naumann: Data fusion. ACM Computing Surveys 41:(1), 2008
  • Melanie Weis, Felix Naumann, Ulrich Jehle, Jens Lufter, Holger Schuster: Industry-scale duplicate detection. PVLDB 1:(2), 2008
    [Paper] 
  • Melanie Herschel, Felix Naumann: Scaling up duplicate detection in graph data. Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), 2008
    [Paper] 
  • Alexander Albrecht, Felix Naumann: Managing ETL Processes. Proceedings of the International Workshop on New Trends in Information Integration, (NTII), Auckland, New Zealand, 2008
  • Katja Hose, Armin Roth, Andre Zeitz, Kai-Uwe Sattler, Felix Naumann: A research agenda for query processing in large-scale peer data management systems. Information Systems (IS) 33:(7-8), 2008
  • Matthias Jacob, Alexander Kuscher, Christoph Thiele, Max Plauth: Automated data augmentation services using text mining, data cleansing and web crawling techniques. IEEE Congress on Services, 2008


2007


  • Jana Bauckmann, Ulf Leser, Felix Naumann, Veronique Tietz: Efficiently Detecting Inclusion Dependencies. Proceedings of the International Conference on Data Engineering (ICDE), 2007
    [Paper] 
  • Felix Naumann: Schema- und Metadatenmanagement in Peer Data Management Systemen. Datenbanksysteme in Business, Technologie und Web (BTW), Workshop Proceedings, 2007
    [Paper] 
  • Frank Legler, Felix Naumann: A Classification of Schema Mappings and Analysis of Mapping Tools. Proceedings of Datenbanksysteme in Business, Technologie und Web (BTW), 2007
    [Paper] 
  • Jens Bleiholder, Karsten Draba, Felix Naumann: FuSem - Exploring Different Semantics of Data Fusion (demo). Proceedings of the International Conference on Very Large Data Bases (VLDB), 2007
    [Paper] 
  • Armin Roth, Felix Naumann: System P: Completeness-driven Query Answering in Peer Data Management Systems (demo). Datenbanksysteme in Business, Technologie und Web (BTW), 2007
    [Paper] 
  • Felix Naumann: Datenqualität. Informatik-Spektrum 30:(1), 2007
    [Paper] 
  • Paul Führing, Felix Naumann: Emergent Data Quality Annotation And Visualization. Proceedings of the International Conference on Information Quality (ICIQ), 2007
    [Paper] 
  • Jochen Hipp, Markus Müller, Johannes Hohendorff, Felix Naumann: Rule-Based Measurement Of Data Quality In Nominal Data. Proceedings of the International Conference on Information Quality (ICIQ), 2007
    [Paper] 
  • Louiqa Raschid, Maria Esther Vidal, Yao Wu, Felix Naumann, Jens Bleiholder: Answering Top K Queries Efficiently with Overlap of Answers in Sources or Source Paths. Proceedings of the International Workshop on Information Integration on the Web (IIWeb), 2007
    [Paper] 
  • Felix Naumann, Armin Roth: Peer-Daten-Management-Systeme - PDMS. Datenbank-Spektrum (2007)
    [Paper] 
  • Ganti Venkatesh, Felix Naumann: Proceedings of the 5th International Workshop on Quality in Databases (QDB). , 2007
  • Alexander Albrecht, Felix Naumann: Networked PIM using PDMS. Proceedings of the International Workshop Networking Meets Databases (NetDB), 2007
    [Paper] 


2006


  • Jens Bleiholder, Felix Naumann: Conflict Handling Strategies in an Integrated Information System. Proceedings of the International Workshop on Information Integration on the Web (IIWeb), 2006
    [Paper] 
  • Jens Bleiholder, Samir Khuller, Felix Naumann, Louiqa Raschid, Yao Wu: Query Planning in the Presence of Overlapping Sources. Proceedings of the International Conference on Extending Database Technology (EDBT), 2006
    [Paper] 
  • Sven Puhlmann, Melanie Weis, Felix Naumann: XML Duplicate Detection Using Sorted Neighborhoods. Proceedings of the International Conference on Extending Database Technology (EDBT), 2006
    [Paper] 
  • Jit Biswas, Felix Naumann, Qiang Qiu: Assessing the Completeness of Sensor Data. Proceedings of the International Conference on Database Systems for Advanced Applications (DASFAA), 2006
    [Paper] 
  • Felix Naumann, Alexander Bilke, Jens Bleiholder, Melanie Weis: Data Fusion in Three Steps: Resolving Schema, Tuple, and Value Inconsistencies. Data Engineering Bulletin 29:(2), 2006
    [Paper] 
  • Jan Hegewald, Felix Naumann, Melanie Weis: XStruct: Efficient Schema Extraction from Multiple and Large XML Documents. Proceedings of the International Conference on Data Engineering (ICDE), 2006
    [Paper] 
  • Jana Bauckmann, Ulf Leser, Felix Naumann: Efficiently Computing Inclusion Dependencies for Schema Discovery. Proceedings of the International Conference on Data Engineering (ICDE), 2006
    [Paper] 
  • Ulf Leser, Felix Naumann, Barbara Eckmann: Proceedings of the Data Integration in the Life Sciences Workshop (DILS). Lecture Notes in Computer Science. Springer, 2006
  • Armin Roth, Felix Naumann, Tobias Hübner, Martin Schweigert: System P: Query Answering in PDMS under Limited Resources. Proceedings of the International Workshop on Information Integration on the Web (IIWeb), 2006
    [Paper] 
  • Ulf Leser, Felix Naumann: Informationsintegration: Architekturen und Methoden zur Integration verteilter und heterogener Datenquellen. dpunkt, 2006
    [Paper] 
  • Melanie Weis, Felix Naumann: Detecting Duplicates in Complex XML Data. Proceedings of the International Conference on Data Engineering (ICDE), 2006
    [Paper] 
  • Felix Naumann, Mary Roth: Information Quality: How Good are Off-the-Shelf DBMS?. Information Quality Management: Theory and Applications. Idea Group Inc., 2006


2005


  • Ulf Leser, Felix Naumann: (Almost) Hands-Off Information Integration for the Life Sciences. Proceedings of the International Conference on Innovative Database Research (CIDR), 2005
    [Paper] 
  • Ralf Heese, Sven Herschel, Felix Naumann, Armin Roth: Self-Extending Peer Data Management. Datenbanksysteme in Business, Technologie und Web (BTW), Karlsruhe, Germany, 2005
    [Paper] 
  • Stephan Heymann, Felix Naumann, Peter Rieger, Louiqa Raschid: Enhancing the Semantics of Links and Paths in Life Science Sources. ICDT Workshop on Database Issues in Biological Databases (DBiBD), 2005
    [Paper] 
  • Jens Bleiholder, Felix Naumann: Declarative Data Fusion - Syntax, Semantics, and Implementation. Proceedings of the International Conference on Advances in Databases and Information Systems (ADBIS), 2005
    [Paper] 
  • : Proceedings of the 2005 International Conference on Information Quality (MIT IQ Conference), Sponsored by Lockheed Martin, MIT, Cambridge, MA, USA, November 10-12, 2006. MIT, 2005
  • Michael Mielke, Heiko Müller, Felix Naumann: Ein Data-Quality-Wettbewerb. Datenbank-Spektrum (2005)
    [Paper] 
  • George A. Mihaila, Felix Naumann, Louiqa Raschid, Maria-Esther Vidal: A Data Model and Query Language to Explore Enhanced Links and Paths in Life Science Sources. Proceedings of the International Workshop on the Web & Databases (WebDB), 2005
    [Paper] 
  • Melanie Weis, Felix Naumann: DogmatiX Tracks down Duplicates in XML. Proceedings of the ACM International Conference on Management of Data (SIGMOD), 2005
    [Paper] 
  • Armin Roth, Felix Naumann: Benefit and Cost of Query Answering in PDMS. Proceedings of the Databases, Information Systems, and Peer-to-Peer Computing Workshop (DBISP2P) Seoul, Korea, 2005
    [Paper] 
  • Melanie Weis: Fuzzy Duplicate Detection on XML Data. Proceedings of the VLDB PhD workshop, 2005
    [Paper] 
  • Alexander Bilke, Felix Naumann: Schema Matching using Duplicates. Proceedings of the International Conference on Data Engineering (ICDE), 2005
    [Paper] 
  • Alexander Bilke, Jens Bleiholder, Christoph Böhm, Karsten Draba, Felix Naumann, Melanie Weis: Automatic Data Fusion with HumMer (demo). Proceedings of the International Conference on Very Large Data Bases (VLDB), 2005
    [Paper] 
  • Melanie Weis, Felix Naumann, Franziska Brosy: A Duplicate Detection Benchmark for XML (and Relational) Data. Proceedings of the SIGMOD International Workshop on Information Quality for Information Systems (IQIS), 2005
    [Paper] 
  • Hagen Höpfner, Gunter Saaske, Felix Naumann, Andreas Heuer: Beitragsband zum Studierenden-Programm bei der 11. Fachtagung "Datenbanken für Business, Technologie and Web", GI Fachbereich Datenbanken und Informationssysteme, Karlsruhe. Universität Magdeburg, Fakultät für Informatik, 2005
  • Mauricio A. Hernández, Lucian Popa, Howard Ho, Felix Naumann: Clio: A Schema Mapping Tool for Information Integration. Proceedings of the International Symposium on Parallel Architectures, Algorithms, and Networks (ISPAN), 2005


2004


  • Felix Naumann, Mary Roth: Information Quality: How Good Are Off-The-Shelf DBMS?. Proceedings of the International Conference on Information Quality (ICIQ), Cambridge, MA, 2004
    [Paper] 
  • Felix Naumann, Monica Scannapieco: Proceedings of the International Workshop on Information Quality in Information Systems (SIGMOD Workshop). ACM, 2004
  • Stephan Heymann, Felix Naumann, Louiqa Raschid, Peter Rieger: Labeling and Enhancing Life Sciences Links. Proceedings of the International IEEE Computer Society Computational Systems Bioinformatics Conference (CSB), 2004
    [Paper] 
  • Felix Naumann, Jens Bleiholder, Melanie Weis: Eine Übung zur Vorlesung Informationsintegration. Datenbank-Spektrum (2004)
    [Paper] 
  • Felix Naumann: Informationsintegration. Öffentliche Vorlesung an der Humboldt-Universität zu Berlin, 2004
  • Armin Roth, Felix Naumann: Qualitäts- und Semantik-gesteuerte Anfragebearbeitung für Peer-basierte Datenmanagementsysteme (PDMS). INFORMATIK - Band 1, Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V. (GI), Ulm, Germany, 2004
    [Paper] 
  • Jens Bleiholder, Felix Naumann, Louiqa Raschid, Maria Esther Vidal: Querying Web-Accessible Life Science Sources: Which paths to choose?. Proceedings of the International Workshop on Information Integration on the Web (IIWeb), 2004
    [Paper] 
  • Zoé Lacroix, Hyma Murthy, Felix Naumann, Louiqa Raschid: Links and Paths through Life Sciences Data Sources. Humboldt-Universität zu Berlin, Institut für Informatik, 2004
    [Paper] 
  • Jens Bleiholder, Zoé Lacroix, Hyma Murthy, Felix Naumann, Louiqa Raschid, Maria-Esther Vidal: BioFast: Challenges in Exploring Linked Life Science Sources. SIGMOD Record 33:(2), 2004
    [Paper] 
  • Zoé Lacroix, Hyma Murthy, Felix Naumann, Louiqa Raschid: Links and Paths through Life Sciences Data Sources. Proceedings of the International WorkshopData Integration in the Life Sciences (DILS), 2004
    [Paper] 
  • Jens Bleiholder, Felix Naumann: FUSE BY: Syntax und Semantik zur Informationsfusion in SQL. INFORMATIK, Band 1, Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V. (GI), 2004
    [Paper] 
  • Melanie Weis, Felix Naumann: Detecting Duplicate Objects in XML Documents. International Workshop on Information Quality in Information Systems (IQIS), 2004
    [Paper] 
  • Felix Naumann, Johann Christoph Freytag, Ulf Leser: Completeness of integrated information sources. Information Systems (IS) 29:(7), 2004


2003


  • Zoé Lacroix, Felix Naumann, Louiqa Raschid, Maria-Esther Vidal: Exploring Life Sciences Data Sources. Proceedings of Workshop on Information Integration on the Web (IIWeb), 2003
    [Paper] 
  • Felix Naumann, Cinzia Capiello, Vipul Kashyap, Gunter Saake: Information Quality Assessment and Measurement. Data Quality on the Web, 2003
  • Vanja Josifovski, Sabine Massmann, Felix Naumann: Super-Fast XML Wrapper Generation in DB2: A Demonstration. Proceedings of the International Conference on Data Engineering (ICDE), 2003
    [Paper] 
  • Mattis Neiling, Steffen Jurk, Hans-J. Lenz, Felix Naumann: Object Identification Quality. Proceedings of the International Workshop on Data Quality in Cooperative Information Systsems (DQCIS), 2003
    [Paper] 
  • Alexander Löser, Felix Naumann, Wolf Siberski, Wolfgang Nejdl, Uwe Thaden: Semantic Overlay Clusters within Super-Peer Networks. First International Workshop on Databases, Information Systems, and Peer-to-Peer Computing (DBISP2P), 2003
    [Paper] 
  • Heiko Müller, Felix Naumann: Data Quality in Genome Databases. Proceedings of the International Conference on Information Quality (ICIQ), 2003
    [Paper] 
  • Felix Naumann: Qualitätsgesteuerte Anfragebearbeitung für Integrierte Informationssysteme. it - Information Technology 45:(1), 2003
    [Paper] 
  • Felix Naumann, Johann-Christoph Freytag, Ulf Leser: Completeness of Information Sources. Proceedings of the International Workshop on Data Quality in Cooperative Information Systsems (DQCIS), 2003
    [Paper] 


2002


  • Periklis Andritsos, Ronald Fagin, Ariel Fuxman, Laura M. Haas, Mauricio A. Hernández, C. T. Howard Ho, Anastasios Kementsietsidis, Renée J. Miller, Felix Naumann, Lucian Popa, Yannis Velegrakis, Charlotte Vilarem, Ling-Ling Yan: Schema Management. Data Engineering Bulletin 25:(3), 2002
    [Paper] 
  • Felix Naumann, Matthias Häussler: Declarative Data Merging with Conflict Resolution. Proceedings of the International Conference on Information Quality (ICIQ), 2002
    [Paper] 
  • Felix Naumann: Quality-Driven Query Answering for Integrated Information Systems. Lecture Notes in Computer Science. Springer, 2002
  • Mauricio A. Hernández, Lucian Popa, Yannis Velegrakis, Renée J. Miller, Felix Naumann, Ching-Tien Ho: Mapping XML and Relational Schemas with Clio (demo). Proceedings of the International Conference on Data Engineering (ICDE), 2002
    [Paper] 
  • Barbara Eckman, Mauricio Hernandez, Howard Ho, Felix Naumann, Lucian Popa: Schema Mapping and Data Integration with Clio (demo). Intelligent Systems for Molecular Biology (ISMB), 2002
    [Paper] 
  • Felix Naumann, Ching-Tien Ho, Xuqing Tian, Laura M. Haas, Nimrod Megiddo: Attribute Classification Using Feature Analysis. Proceedings of the International Conference on Data Engineering (ICDE), 2002
    [Paper] 
  • Felix Naumann, Ching-Tien Ho, Xuqing Tian, Laura Haas, Nimrod Megiddo: Attribute Classification Using Feature Analysis. IBM Almaden Research Center, 2002
    [Paper] 


2001


  • Felix Naumann: From Databases to Information Systems - Information Quality Makes the Difference. Proceedings of the International Conference on Information Quality (ICIQ), 2001
    [Paper] 


2000


  • Torsten Schlieder, Felix Naumann: Approximate Tree Embedding for Querying XML Data. Proceedings of the ACM SIGIR Workshop on XML and Information Retrieval, 2000
    [Paper] 
  • Felix Naumann, Claudia Rolker: Assessment Methods for Information Quality Criteria. Proceedings of the International Conference on Information Quality (ICIQ), 2000
    [Paper] 
  • Felix Naumann, Johann-Christoph Freytag: Completeness of Information Sources. Humboldt-Universität zu Berlin, Institut für Informatik, 2000
    [Paper] 
  • Felix Naumann, Claudia Rolker: Assessment Methods for Information Quality Criteria. Humboldt-Universität zu Berlin, Institut für Informatik, 2000
    [Paper] 
  • Ramana Yerneni, Felix Naumann, Hector Garcia-Molina: Maximizing Coverage of Mediated Web Queries. Stanford University, CA, 2000
    [Paper] 
  • Felix Naumann: Quality-driven Query Planning. Proceedings of the EDBT PhD Workshop, 2000
  • Felix Naumann, Ulf Leser: Cooperative Query Answering with Density Scores. Proceedings of the International Conference on Management of Data (COMAD), 2000
    [Paper] 
  • Ulf Leser, Felix Naumann: Query Planning with Information Quality Bounds. Proceedings of the International Conference on Flexible Query Answering Systems (FQAS), 2000
    [Paper] 


1999


  • Felix Naumann, Ulf Leser, Johann Christoph Freytag: Quality-driven Integration of Heterogeneous Information Systems. Proceedings of International Conference on Very Large Data Bases (VLDB), 1999
    [Paper] 
  • Felix Naumann, Ulf Leser, Johann-Christoph Freytag: Quality-driven Integration of Heterogeneous Information Systems. Humboldt-Universität zu Berlin, Institut für Informatik, 1999
    [Paper] 
  • Felix Naumann, Ulf Leser: Density Scores for Cooperative Query Answering. Workshop on Föderierte Datenbanken (FDBMS), 1999
    [Paper] 
  • Felix Naumann, Claudia Rolker: Do Metadata Models meet IQ Requirements?. Proceedings of the International Conference on Information Quality (ICIQ), 1999
    [Paper] 


1998


  • Felix Naumann, Johann Christoph Freytag, Myra Spiliopoulou: Quality Driven Source Selection Using Data Envelopment Analysis. Proceedings of the International Conference on Information Quality (ICIQ), 1998
    [Paper] 
  • Felix Naumann: Data Fusion and Data Quality. Proceedings of the New Techniques & Technologies for Statistics Seminar (NTTS), 1998
    [Paper]