Prof. Dr. Felix Naumann

All News

Please find all news items since 2011 below.


Two Papers accepted at LWDA 2017

Zhe Zuo, Michael Loster, Felix Naumann, Ralf Krestel, Julian Risch  Our two papers "Uncovering Business Relationships: … > more


Demo accepted at CIKM 2017

Sebastian Kruse, David Hahn, Marius Walter, Felix Naumann Our demo proposal for Metacrate has been accepted for … > more


Paper accepted at TPDL 2017

Julian Risch, Ralf Krestel Our paper "What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers" … > more


Paper accepted at DSMM 2017 workshop

Tim Repke, Michael Loster, Ralf Krestel Our paper "Comparing Features for Ranking Relationships Between Financial Entities … > more


Vision paper accepted at ExploreDB 2017

Tobias Bleifuß, Theodore Johnson, Dmitri V. Kalashnikov, Felix Naumann, Vladislav Shkapenyuk and Divesh Srivastava Our … > more


Short paper accepted at ICDM 2017

Konstantina Lazaridou, Ralf Krestel, Felix Naumann Our paper "Identifying Media Bias by Analyzing Reported Speech" is … > more


Spark Summit 2017 session on Rheem

Rheem has been selected for presentation at the Spark Summit 2017. Located in the Bay Area, the Spark Summit is with over … > more


SIGMOD 2017 Tutorial about Data Profiling

Our tutorial "Data Profiling" will be held at the 2017 SIGMOD conference in Chicago. It is an evolved version of our 2016 … > more


Poster accepted at WWW 2017

The paper titled 'What was Hillary Clinton doing in Katy, Texas?' by Toni Gruetze, Konstantina Lazaridou, Ralf Krestel, and … > more


VLDB 2017 - Call for Papers in Industry track

Call for papers VLDB 2017 Industrial, Applications, and Experience Track August 28 to September 1, 2017 in Munich, … > more


Two papers on data profiling accepted at BTW 2017

Two papers on data profiling have been accepted at the BTW 2017. Both papers describe novel methods to discover different … > more


Metanome Projekt gewinnt Ideenwettbewerb des 10. Nationalen IT-Gipfels

Mit dem Projekt "Metanome - Die Data Profiling Plattform" gewannen Thorsten Papenbrock, Sebastian Kruse, Hazar Harmouch und … > more


Paper accepted at ICDM DINA workshop 2016

The paper "Cluster-based Sorted Neighborhood for Efficient Duplicate Detection" by Ahmad Samiei and Felix Naumann  had been … > more


FG-DB Herbsttagung am HPI (german)

Vom 12. September bis zum 14. September findet am HPI die LWDA Konferenz statt, eine Kombination von Tagungen der vier GI … > more


Tutorial on Graph Exploration at CIKM 2016

Davide Mottin, Anja Jentzsch, and Emmanuel Müller will present the tutorial "Graph Exploration: Taking the user into the … > more


Paper accepted at CIKM 2016

The paper "Approximate Discovery of Functional Dependencies for Large Datasets" has been selected for presentation at the … > more


Tutorial on Rheem at BOSS@VLDB 2016

After a public voting phase, Rheem has been chosen for a tutorial at the workshop for Big Data Open Source Systems … > more


Paper accepted at TPDL Doctoral Consortium 2016

The paper titled 'Identifying Political Bias in News Articles' by Konstantina Lazaridou and Dr. Ralf Krestel has been … > more


Article published in the Data Engineering Bulletin

The quarterly published IEEE Data Engineering Bulletin journal was just released. Its current issue contains articles that … > more


Rheem goes open source

The Rheem project has now published its source code under the Apache License on GitHub. Rheem is a cross-platform data … > more


Paper accepted at NLDB 2016

The paper titled 'Topic Shifts in StackOverflow: Ask it like Socrates' by Toni Gruetze, Ralf Krestel, and Felix Naumann has … > more


Article accepted for Journal of Web Semantics - SI: Knowledge Graphs

The article "CohEEL: Coherent and Efficient Named Entity Linking through Random Walks" by Toni Gruetze (HPI), Gjergji … > more


ACM SIGMOD Blog hosts Felix Naumann

Data optimization already touches many aspects of our lives promising a better, improved, even optimized, world. However, … > more


Two papers on data profiling accepted at SIGMOD 2016

The two papers "A Hybrid Approach to Functional Dependency Discovery" and "RDFind: Finding Conditional Inclusion … > more


Demo accepted at SIGMOD 2016

The Rheem project has been selected for a demo presentation at the SIGMOD 2016 (abstract at the bottom). The submission is … > more


Paper accepted at Q4APS WWW workshop

A full paper has been accepted at the Q4APS workshop at the WWW 2016 conference. The paper is called 'Which Answer is Best? … > more


Metanome version 1.0 released

In the last few months, we introduced many new features in the Metanome data profiling tool: We incorporated several new … > more


Student Paper accepted at EDBT 2016

The article "Holistic Data Profiling: Simultaneous Discovery of Various Metadata" by Jens Ehrlich, Mandy Roick, Lukas … > more


Order dependency detection article accepted for VLDB Journal

The article "Efficient Order Dependency Detection" by Philipp Langer (now IBM) and Felix Naumann (HPI) was accepted for … > more


Felix Naumann wins teaching prize

After being nominated by the student body of HPI, the faculty for mathematics and natural sciences of the University of … > more


Paper accepted at Web Intelligence 2015

The results of the Master thesis by Tobias Schubotz are being presented at the Web Intelligence conference in Singapore in … > more


Paper accepted at LWA 2015

The paper titled 'How to Stay Up-to-date on Twitter with General Keywords' by Mandy Roick, Maximilian Jenders, and Ralf … > more


Demo and Paper accepted at ISWC 2015 and ISWC 2015 Workshop

Demo at ISWC 2015  Exploring Linked Data Graph Structures Anja Jentzsch, Christian Dullweber, Pierpaolo Troiano, Felix … > more


Paper accepted at KI 2015

Full Paper accepted at KI 2015: A Serendipity Model For News Recommendation Maximilian Jenders, Thorben Lindhauer, … > more


Markus Freitag wins TDWI Award for master's thesis

The former master's student Markus Freitag has won the prestigious TDWI award for the best master's thesis in the area of … > more


Demo accepted for VLDB 2015

The demonstration paper "Data Profiling with Metanome" was accepted for the 2015 VLDB conference. The authors are Thorsten … > more


Survey on Data Profiling published in VLDB Journal

The article "Profiling relational data: a survey" by Ziawasch Abedjan (MIT), Lukasz Golab (University of Waterloo) and … > more


German news article about R

Dr. Ralf Krestel about the programming language R and its use for predictive analytics (in German):  … > more


Metanome presented at Sapphire 2015

The data profiling framework Metanome is presented at SAP's Sapphire 2015 conference in Orlando. See here for details and … > more


Second paper accepted at VLDB 2015

Experiments and Analysis Paper Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms Thorsten … > more


Dr. Arvid Heise

Arvid Heise has successfully defended his Ph.D. dissertation on March 17, 2015! His work focused on the topic "Data … > more


Poster accepted at WWW 2015

Research Poster Paper Tweet-Recommender: Finding Relevant Tweets for News Articles Ralf Krestel and Thomas Werkmeister … > more


Paper accepted at TempWeb 2015

Research Paper Learning Temporal Tagging Behaviour Toni Gruetze, Gary Yao, and Ralf Krestel Abstract. Social networking … > more


Ziawasch Abedjan wins dissertation award

Dr. Ziawasch Abedjan graduated from HPI in June 2014. His dissertation with the title "Improving RDF Data with Data Mining" … > more


Paper accepted at VLDB 2015

Research Paper Divide&Conquer-based Inclusion Dependency Discovery Thorsten Papenbrock, Sebastian Kruse, Jorge-Arnulfo … > more


Apache Flink is a top-level project

After eight months in the incubating phase, the Apache Software Foundation board unanimously passed the resolution to … > more


Paper accepted at BTW 2015

Research Paper Scaling out the Discovery of Inclusion Dependencies Sebastian Kruse, Thorsten Papenbrock, Felix Naumann  … > more


Paper accepted at EDBT 2015

Research Paper Estimating Data Integration and Cleaning Effort Sebastian Kruse, Paolo Papotti, Felix Naumann Abstract.  … > more


Dr. Alexander Albrecht

Alexander Albrecht has successfully defended his Ph.D. dissertation on November 26, 2014! His work focused on the topic … > more


CIKM 2014 Best Student Paper Award

Our submission "DFD: Efficient Functional Dependency Discovery" by Ziawasch Abedjan, Patrick Schulze, and Felix Naumann to … > more


Paper accepted at DINA

1st International Workshop on Data Integration and Applications co-located with the IEEE International Conference on Data … > more


Dr. Johannes Lorey defended his Ph.D. dissertation

Johannes Lorey has successfully defended his Ph.D. dissertation on October 27, 2014! His work focused on the … > more


Anja Jentzsch wins Semantic Web Journal 2014 Outstanding Paper Award

Together with her co-authors Jens Lehmann, Robert Isele, Max Jakob, Dimitris Kontokostas, Pablo N. Mendes, Sebastian … > more


Internet-Wachstum: Datenweb seit 2011 mehr als verdreifacht

Das „Web der Daten“ hat sich seit Herbst 2011 mehr als verdreifacht. Das ist das Ergebnis einer Analyse, die … > more


Journal article accepted at TKDE

Progressive Duplicate Detection Thorsten Papenbrock and Arvid Heise and Felix Naumann Abstract. Duplicate detection is … > more


2 full papers accepted at CIKM

Estimating the Number and Sizes of Fuzzy-Duplicate Clusters Arvid Heise, Gjergji Kasneci, and Felix Naumann Abstract. … > more


Ziawasch Abedjan defended his Ph.D. dissertation

Ziawasch Abedjan has successfully defended his Ph.D. dissertation with distinction on July 18, 2014! His work focused on … > more


Paper accepted at COLING

25th International Conference on Computational Linguistics (COLING) August 23, 2014, Dublin, Ireland  … > more


Know@LOD Paper selected for "Best of Workshop" Session

Our paper "Ziawasch Abedjan and Felix Naumann. Amending RDF Entities with New Facts" from KNOW@LOD 2014 workshop has been … > more


Stratosphere accepted as Apache Incubator Project

We are happy to announce that Stratosphere has been accepted as a project for the Apache Incubator. The proposal has been … > more


2 Papers accepted at ESWC Workshops.

Know@LOD 2014 and PROFILES 2014, co-located with 10th Extended Semantic Web Conference (ESWC) 2014  … > more


Stratosphere overview paper accepted for VLDB Journal

The Stratosphere Platform for Big Data Analytics Alexander Alexandrov, Rico Bergmann, Stephan Ewen, Johann-Christoph … > more


SIGMOD Demo accepted

Versatile optimization of UDF-heavy data flows with Sofa Astrid Rheinländer, Martin Beckmann, Anja Kunkel, Arvid Heise, … > more


Paper accepted at DINA

Research Paper Bootstrapping Wikipedia to Answer Ambiguous Person Name Queries Toni Gruetze, Gjergji Kasneci, Zhe Zuo, … > more


Paper accepted at DESWeb

5th International Workshop on Data Engineering meets the Semantic Web (DESWeb) In conjunction with ICDE 2014, Chicago … > more


DFG research unit "Stratosphere" extended

Joint research on Stratosphere by TU Berlin, HU Berlin, and HPI > more


Article accepted for Informatik-Spektrum

Ein Datenbankkurs mit 6.000 Teilnehmern: Erfahrungen auf der openHPI MOOC Plattform > more


Research Paper and Demo accepted for ICDE 2014

30th IEEE International Conference on Data Engineering (ICDE), Chicago, IL, USA, March 31st - April 4th, 2014  … > more


Paper accepted at VLDB 2014

40th International Conference on Very Large Data Bases (VLDB), Hangzhou, China, 1st - 5th September 2014 Scalable … > more


Paper accepted at iiWAS 2013

15th International Conference on Information Integration and Web-based Applications & Services  … > more


Dr. Christoph Böhm

Christoph Böhm has successfully defended his Ph.D. dissertation on September 13, 2013. > more


2 Papers at ICIQ - International Conference on Information Quality

Systematic ETL Management – Experiences with high-level Operators by Alexander Albrecht and Felix Naumann and On … > more


Database Genealogy - V4 released

We have just released the latest version of our RDBMS Genealogy showing a timeline of many popular relational database … > more


Dr. Jana Bauckmann

Jana Bauckmann has successfully defended her Ph.D. dissertation on June 14, 2013. > more


Dr. Dustin Lange

Dustin Lange successfully defends his PhD thesis "Effective and Efficient Similarity Search in Databases".  … > more


Datenbank-Spektrum Article Accepted

Special Issue on RDF Data Management (German Database Forum) > more


Paper accepted at SSDBM 2013

25th International Conference on Scientific and Statistical Database Management (SSDBM), July 29-31, 2013, Baltimore, … > more


Data Profiling Revisited: Article accepted for SIGMOD Record

Felix Naumann. Data Profiling Revisited. SIGMOD Record (to appear), 2013. Data profiling comprises a broad range of … > more


Paper accepted at MSND workshop @ WWW 2013

Analyzing and Predicting Viral Tweets Maximilian Jenders, Gjergji Kasneci, and Felix Naumann Abstract. Twitter and other … > more


Runner Up for Best Paper Award at BTW 2013

The submission "Duplicate Detection on GPUs" by Benedikt Forchhammer, Thorsten Papenbrock, Thomas Stening, Sven … > more


Contributions to ESWC 2013

10th Extended Semantic Web Conference in Montpellier, France > more


Article accepted at Information Systems Journal (IS)

Cost-Aware Query Planning for Similarity Search Dustin Lange and Felix Naumann Abstract. Similarity search aims to find … > more


Paper and demo accepted at BTW Conference

15th BTW conference on "Database Systems for Business, Technology, and Web" (BTW 2013) Magdeburg, Germany Duplicate … > more


Felix Naumann gives keynote talk at ICIQ 2012

On November 17 Felix Naumann talked about "The Quality of Web Data" at the 2012 International Conference on Information … > more


Article accepted at Information Systems Journal (IS)

Cross-lingual Entity Matching and Infobox Alignment in Wikipedia Daniel Rinser, Dustin Lange, and Felix Naumann  … > more


bibDuDe deduplicates BibTeX files

A tool to deduplicate scientific references > more


Article accepted at Int. Journal of Data Warehousing and Mining (IJDWM)

Fusion Cubes: Towards Self-Service Business Intelligence Alberto Abelló, Jérôme Darmont, Lorena Etcheverry, Matteo … > more


Felix Naumann gives keynote talk at ICWE 2012

On July 26 Felix Naumann talked about "Extreme Web Data Integration" at the 2012 International Conference on Web … > more


3 Papers (short) accepted at CIKM 2012

21st ACM International Conference on Information and Knowledge Management (CIKM) will be held from October 29 to November … > more


3 Papers accepted at VLDB Workshops

DBRank 2012 – 6th International Workshop on Ranking in Databases, in conjunction with VLDB 2012 Scalable Similarity … > more


Paper accepted at I-Semantics Conference

I-SEMANTICS 2012 – 8th Int. Conference on Semantic Systems, Graz, Austria Scalable Peer-to-Peer-based RDF Management  … > more


Paper accepted at ER Conference

31st International Conference on Conceptual Modeling (ER 2012) - Florence, Italy Schema Decryption for Large … > more


Paper accepted at SSDBM

Proceedings of the 24th International Conference on Scientific and Statistical Database Management, 25-27 June 2012 Chania, … > more


Contributions to WWW 2012

Demo and LDOW paper accepted > more


JWS Article Accepted

Integrating Open Government Data with Stratosphere for more Transparency Arvid Heise and Felix Naumann Abstract. … > more


LREC Paper Accepted

The eighth international conference on Language Resources and Evaluation (LREC), Istanbul, Turkey. "Fine-grained … > more


Daniel Rinser wins award for his masters thesis

IQ Best Master Degree Wettbewerb der Deutschen Gesellschaft für Informations- und Datenqualität e. V. (DGIQ)  … > more


HPI TV releases video about GovWILD

See the new video about our Government Data Integration platform GovWILD. > more


Dr. Mohammed AbuJarour

"Enriched Service Descriptions: Sources, Approaches, and Usages" > more


Tool voidGen released

As part of our winning submission at the 2010 Billion Triple Challenge at the International Semantic Web Conference, we … > more


ICDE Paper Accepted

28th IEEE International Conference on Data Engineering (ICDE) Washington, DC, USA Adaptive Windows for Duplicate … > more


GovWILD in LOD cloud

The GovWILD team is happy to announce that the latest version of the LOD cloud (September 2011) includes the GovWILD data … > more


CoopIS Paper Accepted

The 19th International Conference on Cooperative Information Systems (CoopIS) Crete, Greece Instance-based "one-to-some" … > more


ICSOC Paper Accepted

Revealing Hidden Relations among Web Services Using Business Process Knowledge ... Mohammed AbuJarour and Ahmed Awad  … > more


5 Papers Accepted at CIKM 2011/ 1 Paper Accepted at the co-located SMER Workshop

Proceedings of the 20th ACM Conference on Information and Knowledge Management, CIKM 2011, Glasgow, UK, October 24-28, 2011 … > more


JWS Article Accepted

Journal of Web Semantics: Science, Services and Agents on the World Wide Web, 9(3):339-345, 9/2011 Creating voiD … > more


Wikipedia extraction data published

With iPopulator, we have introduced a system that automatically populates infoboxes of Wikipedia articles by extracting … > more


ICDKE Paper Accepted

International Conference on Data and Knowledge Engineering (ICDKE 2011), Milan A Generalization of Blocking and Windowing … > more


Black Swan - Discovering events that matter.

As part of a seminar supervised by Prof. Naumann and Johannes Lorey in the last winter term, a group of students examined … > more


SCC Paper Accepted

Discovering Linkage Patterns among Web Services using Business Process Knowledge ... Mohammed AbuJarour and Ahmed Awad  … > more


ICWS Paper Accepted

Automatic Sampling of Web Services Mohammed AbuJarour and Sebastian Oergel > more


Dr. Armin Roth

"Efficient Query Answering in Peer Data Management Systems" ... > more


TKDE Paper accepted

Scalable Iterative Graph Duplicate DetectionMelanie Herschel, Felix Naumann, Sascha Szott, and Maik TaubertTransaktions on … > more


Dr. Jens Bleiholder

"Data Fusion and Conflict Resolution in Integrated Information Systems" ... > more


Dr. Falk Brauer

"Extraktion und Identifikation von Entitäten in Textdaten im Umfeld der Enterprise Search" ...  … > more