Prof. Dr. Felix Naumann


Two papers accepted at JCDL 2018

Stefan Bunk, Ralf Krestel, Julian Risch The results of the Master's theses by Stefan Bunk and Julian Risch have been … > more


Full paper accepted at VLDB 2018

Sebastian Kruse and Felix Naumann Our paper "Efficient Discovery of Approximate Dependencies", which introduces the Pyro … > more


Data quality article appears in SIGMOD Record

Shazia Sadiq, Juliana Freire, Renée J. Miller, Tamraparni Dasu, Ihab F. Ilyas, Felix Naumann, Divesh Srivastava, … > more


Paper accepted at BigVis@EDBT 2018

Tim Repke and Ralf Krestel Our paper "Topic-aware Network Visualisation to Explore Large Email Corpora" has been accepted … > more



New project page and video available for Metacrate. Click the link to get more information about that open … > more


Dr. Thorsten Papenbrock

Thorsten Papenbrock has successfully defended his Ph.D. dissertation on December 19th, 2017! His work focused on the topic … > more


Paper accepted at ECIR 2018

Tim Repke and Ralf Krestel Our paper "Bringing Back Structure to Free Text Email Conversations with Recurrent Neural … > more


Article published in prestigious Nucleic Acids Research journal

We are very proud that our work on "Extracting Intuitive Sequence-Structure Motifs from High-Throughput RNA-Binding Protein … > more


Metanome Algorithms released on GitHub

The Metanome research group has just released most of their data profiling algorithms for the Metanome platform on GitHub … > more


Experimental survey accpted at VLDB 2018

Hazar Harmouch and Felix Naumann Our paper "Cardinality Estimation: An Experimental Survey" has been accepted for … > more


Paper accepted at VLDB 2018

Tobias Bleifuß, Sebastian Kruse and Felix Naumann Our paper "Efficient Denial Constraint Discovery with Hydra" has been … > more


Short paper accepted at ICDM 2017

Konstantina Lazaridou, Ralf Krestel, Felix Naumann Our paper "Identifying Media Bias by Analyzing Reported Speech" is … > more


Best Short Paper Award received at TPDL2017

Our submission “What Should I Cite? Cross-Collection Reference Recommendation for Patents and Papers” by Julian Risch and … > more


Two Papers accepted at LWDA 2017

Zhe Zuo, Michael Loster, Felix Naumann, Ralf Krestel, Julian Risch  Our two papers "Uncovering Business … > more


Demo accepted at CIKM 2017

Sebastian Kruse, David Hahn, Marius Walter, Felix Naumann Our demo proposal for Metacrate has been accepted for … > more


Paper accepted at TPDL 2017

Julian Risch, Ralf Krestel Our paper "What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers" … > more


Paper accepted at DSMM 2017 workshop

Tim Repke, Michael Loster, Ralf Krestel Our paper "Comparing Features for Ranking Relationships Between Financial Entities … > more


Vision paper accepted at ExploreDB 2017

Tobias Bleifuß, Theodore Johnson, Dmitri V. Kalashnikov, Felix Naumann, Vladislav Shkapenyuk and Divesh Srivastava Our … > more


Spark Summit 2017 session on Rheem

Rheem has been selected for presentation at the Spark Summit 2017. Located in the Bay Area, the Spark Summit is with over … > more


SIGMOD 2017 Tutorial about Data Profiling

Our tutorial "Data Profiling" will be held at the 2017 SIGMOD conference in Chicago. It is an evolved version of our 2016 … > more


Poster accepted at WWW 2017

The paper titled 'What was Hillary Clinton doing in Katy, Texas?' by Toni Gruetze, Konstantina Lazaridou, Ralf … > more


VLDB 2017 - Call for Papers in Industry track

Call for papers VLDB 2017 Industrial, Applications, and Experience Track August 28 to September 1, 2017 in Munich, … > more


Two papers on data profiling accepted at BTW 2017

Two papers on data profiling have been accepted at the BTW 2017. Both papers describe novel methods to discover different … > more


Metanome Projekt gewinnt Ideenwettbewerb des 10. Nationalen IT-Gipfels

Mit dem Projekt "Metanome - Die Data Profiling Plattform" gewannen Thorsten Papenbrock, Sebastian Kruse, Hazar Harmouch und … > more


Paper accepted at ICDM DINA workshop 2016

The paper "Cluster-based Sorted Neighborhood for Efficient Duplicate Detection" by Ahmad Samiei and Felix … > more


FG-DB Herbsttagung am HPI (german)

Vom 12. September bis zum 14. September findet am HPI die LWDA Konferenz statt, eine Kombination von Tagungen der vier GI … > more


Tutorial on Graph Exploration at CIKM 2016

Davide Mottin, Anja Jentzsch, and Emmanuel Müller will present the tutorial "Graph Exploration: Taking the user into the … > more


Paper accepted at CIKM 2016

The paper "Approximate Discovery of Functional Dependencies for Large Datasets" has been selected for presentation at the … > more


Tutorial on Rheem at BOSS@VLDB 2016

After a public voting phase, Rheem has been chosen for a tutorial at the workshop for Big Data Open Source … > more


Paper accepted at TPDL Doctoral Consortium 2016

The paper titled 'Identifying Political Bias in News Articles' by Konstantina Lazaridou and Dr. Ralf Krestel has been … > more


Article published in the Data Engineering Bulletin

The quarterly published IEEE Data Engineering Bulletin journal was just released. Its current issue contains articles that … > more


Rheem goes open source

The Rheem project has now published its source code under the Apache License on GitHub. Rheem is a cross-platform data … > more


Paper accepted at NLDB 2016

The paper titled 'Topic Shifts in StackOverflow: Ask it like Socrates' by Toni Gruetze, Ralf Krestel, and Felix Naumann has … > more


Article accepted for Journal of Web Semantics - SI: Knowledge Graphs

The article "CohEEL: Coherent and Efficient Named Entity Linking through Random Walks" by Toni Gruetze (HPI), Gjergji … > more


ACM SIGMOD Blog hosts Felix Naumann

Data optimization already touches many aspects of our lives promising a better, improved, even optimized, world. However, … > more


Two papers on data profiling accepted at SIGMOD 2016

The two papers "A Hybrid Approach to Functional Dependency Discovery" and "RDFind: Finding Conditional Inclusion … > more


Demo accepted at SIGMOD 2016

The Rheem project has been selected for a demo presentation at the SIGMOD 2016 (abstract at the bottom). The submission is … > more


Paper accepted at Q4APS WWW workshop

A full paper has been accepted at the Q4APS workshop at the WWW 2016 conference. The paper is called 'Which Answer is Best? … > more


Metanome version 1.0 released

In the last few months, we introduced many new features in the Metanome data profiling tool: We incorporated several new … > more


Student Paper accepted at EDBT 2016

The article "Holistic Data Profiling: Simultaneous Discovery of Various Metadata" by Jens Ehrlich, Mandy Roick, Lukas … > more


Order dependency detection article accepted for VLDB Journal

The article "Efficient Order Dependency Detection" by Philipp Langer (now IBM) and Felix Naumann (HPI) was accepted for … > more


Felix Naumann wins teaching prize

After being nominated by the student body of HPI, the faculty for mathematics and natural sciences of the University of … > more


Paper accepted at Web Intelligence 2015

The results of the Master thesis by Tobias Schubotz are being presented at the Web Intelligence conference in Singapore in … > more


Paper accepted at LWA 2015

The paper titled 'How to Stay Up-to-date on Twitter with General Keywords' by Mandy Roick, Maximilian Jenders, and … > more


Demo and Paper accepted at ISWC 2015 and ISWC 2015 Workshop

Demo at ISWC 2015  Exploring Linked Data Graph Structures Anja Jentzsch, Christian Dullweber, Pierpaolo Troiano, … > more


Paper accepted at KI 2015

Full Paper accepted at KI 2015: A Serendipity Model For News Recommendation Maximilian Jenders, Thorben Lindhauer, … > more


Markus Freitag wins TDWI Award for master's thesis

The former master's student Markus Freitag has won the prestigious TDWI award for the best master's thesis in the area of … > more


Demo accepted for VLDB 2015

The demonstration paper "Data Profiling with Metanome" was accepted for the 2015 VLDB conference. The authors are … > more


Survey on Data Profiling published in VLDB Journal

The article "Profiling relational data: a survey" by Ziawasch Abedjan (MIT), Lukasz Golab (University of Waterloo) and … > more


German news article about R

Dr. Ralf Krestel about the programming language R and its use for predictive analytics (in German):  … > more


Metanome presented at Sapphire 2015

The data profiling framework Metanome is presented at SAP's Sapphire 2015 conference in Orlando. See here for details and … > more


Second paper accepted at VLDB 2015

Experiments and Analysis Paper Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms Thorsten … > more


Dr. Arvid Heise

Arvid Heise has successfully defended his Ph.D. dissertation on March 17, 2015! His work focused on the topic "Data … > more


Poster accepted at WWW 2015

Research Poster Paper Tweet-Recommender: Finding Relevant Tweets for News Articles Ralf Krestel and Thomas Werkmeister … > more


Paper accepted at TempWeb 2015

Research Paper Learning Temporal Tagging Behaviour Toni Gruetze, Gary Yao, and Ralf Krestel Abstract. Social … > more


Ziawasch Abedjan wins dissertation award

Dr. Ziawasch Abedjan graduated from HPI in June 2014. His dissertation with the title "Improving RDF Data with Data Mining" … > more


Paper accepted at VLDB 2015

Research Paper Divide&Conquer-based Inclusion Dependency Discovery Thorsten Papenbrock, Sebastian Kruse, … > more


Apache Flink is a top-level project

After eight months in the incubating phase, the Apache Software Foundation board unanimously passed the resolution to … > more


Paper accepted at BTW 2015

Research Paper Scaling out the Discovery of Inclusion Dependencies Sebastian Kruse, Thorsten Papenbrock, Felix … > more


Paper accepted at EDBT 2015

Research Paper Estimating Data Integration and Cleaning Effort Sebastian Kruse, Paolo Papotti, Felix Naumann  … > more


Dr. Alexander Albrecht

Alexander Albrecht has successfully defended his Ph.D. dissertation on November 26, 2014! His work focused on the topic … > more


CIKM 2014 Best Student Paper Award

Our submission "DFD: Efficient Functional Dependency Discovery" by Ziawasch Abedjan, Patrick Schulze, and Felix Naumann to … > more


Paper accepted at DINA

1st International Workshop on Data Integration and Applications co-located with the IEEE International Conference on Data … > more


Dr. Johannes Lorey defended his Ph.D. dissertation

Johannes Lorey has successfully defended his Ph.D. dissertation on October 27, 2014! His work focused on the … > more


Anja Jentzsch wins Semantic Web Journal 2014 Outstanding Paper Award

Together with her co-authors Jens Lehmann, Robert Isele, Max Jakob, Dimitris Kontokostas, Pablo N. Mendes, Sebastian … > more


Internet-Wachstum: Datenweb seit 2011 mehr als verdreifacht

Das „Web der Daten“ hat sich seit Herbst 2011 mehr als verdreifacht. Das ist das Ergebnis einer Analyse, die … > more


Journal article accepted at TKDE

Progressive Duplicate Detection Thorsten Papenbrock and Arvid Heise and Felix Naumann Abstract. Duplicate detection is … > more


2 full papers accepted at CIKM

Estimating the Number and Sizes of Fuzzy-Duplicate Clusters Arvid Heise, Gjergji Kasneci, and Felix Naumann Abstract. … > more


Ziawasch Abedjan defended his Ph.D. dissertation

Ziawasch Abedjan has successfully defended his Ph.D. dissertation with distinction on July 18, 2014! His work focused on … > more


Paper accepted at COLING

25th International Conference on Computational Linguistics (COLING) August 23, 2014, Dublin, Ireland  … > more


Know@LOD Paper selected for "Best of Workshop" Session

Our paper "Ziawasch Abedjan and Felix Naumann. Amending RDF Entities with New Facts" from KNOW@LOD 2014 workshop … > more


Stratosphere accepted as Apache Incubator Project

We are happy to announce that Stratosphere has been accepted as a project for the Apache Incubator. The proposal has been … > more


2 Papers accepted at ESWC Workshops.

Know@LOD 2014 and PROFILES 2014, co-located with 10th Extended Semantic Web Conference (ESWC) 2014  … > more


Stratosphere overview paper accepted for VLDB Journal

The Stratosphere Platform for Big Data Analytics Alexander Alexandrov, Rico Bergmann, Stephan Ewen, Johann-Christoph … > more


SIGMOD Demo accepted

Versatile optimization of UDF-heavy data flows with Sofa Astrid Rheinländer, Martin Beckmann, Anja Kunkel, Arvid Heise, … > more


Paper accepted at DINA

Research Paper Bootstrapping Wikipedia to Answer Ambiguous Person Name Queries Toni Gruetze, Gjergji Kasneci, … > more


Paper accepted at DESWeb

5th International Workshop on Data Engineering meets the Semantic Web (DESWeb) In conjunction with ICDE 2014, Chicago … > more


DFG research unit "Stratosphere" extended

Joint research on Stratosphere by TU Berlin, HU Berlin, and HPI > more


Article accepted for Informatik-Spektrum

Ein Datenbankkurs mit 6.000 Teilnehmern: Erfahrungen auf der openHPI MOOC Plattform > more


Research Paper and Demo accepted for ICDE 2014

30th IEEE International Conference on Data Engineering (ICDE), Chicago, IL, USA, March 31st - April 4th, 2014  … > more


Paper accepted at VLDB 2014

40th International Conference on Very Large Data Bases (VLDB), Hangzhou, China, 1st - 5th September 2014 Scalable … > more


Paper accepted at iiWAS 2013

15th International Conference on Information Integration and Web-based Applications & Services  … > more


Dr. Christoph Böhm

Christoph Böhm has successfully defended his Ph.D. dissertation on September 13, 2013. > more


2 Papers at ICIQ - International Conference on Information Quality

Systematic ETL Management – Experiences with high-level Operators by Alexander Albrecht and Felix Naumann and On … > more


Database Genealogy - V4 released

We have just released the latest version of our RDBMS Genealogy showing a timeline of many popular relational database … > more


Dr. Jana Bauckmann

Jana Bauckmann has successfully defended her Ph.D. dissertation on June 14, 2013. > more


Dr. Dustin Lange

Dustin Lange successfully defends his PhD thesis "Effective and Efficient Similarity Search in Databases".  … > more


Datenbank-Spektrum Article Accepted

Special Issue on RDF Data Management (German Database Forum) > more


Paper accepted at SSDBM 2013

25th International Conference on Scientific and Statistical Database Management (SSDBM), July 29-31, 2013, Baltimore, … > more


Data Profiling Revisited: Article accepted for SIGMOD Record

Felix Naumann. Data Profiling Revisited. SIGMOD Record (to appear), 2013. Data profiling comprises a … > more


Paper accepted at MSND workshop @ WWW 2013

Analyzing and Predicting Viral Tweets Maximilian Jenders, Gjergji Kasneci, and Felix Naumann Abstract. Twitter and … > more


Runner Up for Best Paper Award at BTW 2013

The submission "Duplicate Detection on GPUs" by Benedikt Forchhammer, Thorsten Papenbrock, Thomas Stening, Sven … > more


Contributions to ESWC 2013

10th Extended Semantic Web Conference in Montpellier, France > more


Article accepted at Information Systems Journal (IS)

Cost-Aware Query Planning for Similarity Search Dustin Lange and Felix Naumann Abstract. Similarity search aims to … > more


Paper and demo accepted at BTW Conference

15th BTW conference on "Database Systems for Business, Technology, and Web" (BTW 2013) Magdeburg, Germany  … > more


Felix Naumann gives keynote talk at ICIQ 2012

On November 17 Felix Naumann talked about "The Quality of Web Data" at the 2012 International Conference on … > more


Article accepted at Information Systems Journal (IS)

Cross-lingual Entity Matching and Infobox Alignment in Wikipedia Daniel Rinser, Dustin Lange, and Felix Naumann  … > more


bibDuDe deduplicates BibTeX files

A tool to deduplicate scientific references > more


Article accepted at Int. Journal of Data Warehousing and Mining (IJDWM)

Fusion Cubes: Towards Self-Service Business Intelligence Alberto Abelló, Jérôme Darmont, Lorena … > more


Felix Naumann gives keynote talk at ICWE 2012

On July 26 Felix Naumann talked about "Extreme Web Data Integration" at the 2012 International Conference on Web … > more


3 Papers (short) accepted at CIKM 2012

21st ACM International Conference on Information and Knowledge Management (CIKM) will be held from October 29 to November … > more


3 Papers accepted at VLDB Workshops

DBRank 2012 – 6th International Workshop on Ranking in Databases, in conjunction with VLDB 2012 Scalable … > more


Paper accepted at I-Semantics Conference

I-SEMANTICS 2012 – 8th Int. Conference on Semantic Systems, Graz, Austria Scalable Peer-to-Peer-based RDF Management  … > more


Paper accepted at ER Conference

31st International Conference on Conceptual Modeling (ER 2012) - Florence, Italy Schema Decryption for Large … > more


Paper accepted at SSDBM

Proceedings of the 24th International Conference on Scientific and Statistical Database Management, 25-27 June … > more


Contributions to WWW 2012

Demo and LDOW paper accepted > more


JWS Article Accepted

Integrating Open Government Data with Stratosphere for more Transparency Arvid Heise and Felix Naumann Abstract. … > more


LREC Paper Accepted

The eighth international conference on Language Resources and Evaluation (LREC), Istanbul, Turkey. "Fine-grained … > more


Daniel Rinser wins award for his masters thesis

IQ Best Master Degree Wettbewerb der Deutschen Gesellschaft für Informations- und Datenqualität e. V. (DGIQ)  … > more


HPI TV releases video about GovWILD

See the new video about our Government Data Integration platform GovWILD. > more


Dr. Mohammed AbuJarour

"Enriched Service Descriptions: Sources, Approaches, and Usages" > more


Tool voidGen released

As part of our winning submission at the 2010 Billion Triple Challenge at the International Semantic Web Conference, we … > more


ICDE Paper Accepted

28th IEEE International Conference on Data Engineering (ICDE) Washington, DC, USA Adaptive Windows for Duplicate … > more


GovWILD in LOD cloud

The GovWILD team is happy to announce that the latest version of the LOD cloud (September 2011) includes the GovWILD data … > more


CoopIS Paper Accepted

The 19th International Conference on Cooperative Information Systems (CoopIS) Crete, Greece Instance-based … > more


ICSOC Paper Accepted

Revealing Hidden Relations among Web Services Using Business Process Knowledge ... Mohammed AbuJarour and Ahmed Awad  … > more


5 Papers Accepted at CIKM 2011/ 1 Paper Accepted at the co-located SMER Workshop

Proceedings of the 20th ACM Conference on Information and Knowledge Management, CIKM 2011, Glasgow, UK, October 24-28, 2011 … > more


JWS Article Accepted

Journal of Web Semantics: Science, Services and Agents on the World Wide Web, 9(3):339-345, 9/2011 Creating voiD … > more


Wikipedia extraction data published

With iPopulator, we have introduced a system that automatically populates infoboxes of Wikipedia articles by … > more


ICDKE Paper Accepted

International Conference on Data and Knowledge Engineering (ICDKE 2011), Milan A Generalization of Blocking and Windowing … > more


Black Swan - Discovering events that matter.

As part of a seminar supervised by Prof. Naumann and Johannes Lorey in the last winter term, a group of students … > more


SCC Paper Accepted

Discovering Linkage Patterns among Web Services using Business Process Knowledge ... Mohammed AbuJarour and Ahmed Awad  … > more


ICWS Paper Accepted

Automatic Sampling of Web Services Mohammed AbuJarour and Sebastian Oergel > more


Dr. Armin Roth

"Efficient Query Answering in Peer Data Management Systems" ... > more


TKDE Paper accepted

Scalable Iterative Graph Duplicate DetectionMelanie Herschel, Felix Naumann, Sascha Szott, and Maik TaubertTransaktions on … > more


Dr. Jens Bleiholder

"Data Fusion and Conflict Resolution in Integrated Information Systems" ... > more


Dr. Falk Brauer

"Extraktion und Identifikation von Entitäten in Textdaten im Umfeld der Enterprise Search" ...  … > more