Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
 

05.06.2017

What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers

Julian Risch, Ralf Krestel

Our paper "What Should I Cite? Cross-Collection Reference Recommendation of Patents and Papers" has been accepted for the International Conference on Theory and Practice of Digital Libraries (TPDL 2017). The conference takes place in September 2017 at Thessaloniki, Greece.

Abstract

Research results manifest in large corpora of patents and scientific papers. However, both corpora lack a consistent taxonomy and references across different document types are sparse. Therefore, and because of contrastive, domain-specific language, recommending similar papers for a given patent (or vice versa) is challenging.
We propose a hybrid recommender system that leverages topic distributions and key terms to recommend related work despite these challenges. As a case study, we evaluate our approach on patents and papers of two fields: medical and computer science. We find that topic-based recommenders complement term-based recommenders for documents with collection-specific language and increase mean average precision by up to 23%. As a result of our work, publications from both corpora form a joint digital library, which connects academia and industry.