Ralf Krestel

You are here: Home > Publications > Conference Papers > ICDIM 18

About Me
Publications
- Book Chapters
- Journal Articles
- Conference Papers
  - ESWC 21
  - CHIIR 21
  - WI 20
  - CIKM 20
  - KI 20
  - ICWSM 20
  - JCDL 20a
  - JCDL 20b
  - TPDL 19
  - ICADL 18
  - ICDIM 18
  - JCDL 18a
  - JCDL 18b
  - NAACL 18
  - ECIR 18
  - ICDM 17
  - TPDL 17
  - NLDB 16
  - WI 15
  - KI 15
  - RECSYS 13
  - TPDL 13
  - HT 13
  - WI 11
  - KI 10
  - ECDL 10
  - WI 10
  - NLDB 10
  - LAWEB 09
  - RECSYS 09
  - WI 08
  - ASWC 08
  - LAWEB 08
  - LREC 08
  - RANLP 07
  - CanadianAI 07
- Workshop Papers
- Posters & Demos
- Proceedings
- Others
Travels

ICDIM 18

Learning Patent Speak: Investigating Domain-Specific Word Embeddings

Abstract

A patent examiner needs domain-specific knowledge to classify a patent application according to its field of invention. Standardized classification schemes help to compare a patent application to previously granted patents and thereby check its novelty. Due to the large volume of patents, automatic patent classification would be highly beneficial to patent offices and other stakeholders in the patent domain. However, a challenge for the automation of this costly manual task is the patent-specific language use. To facilitate this task, we present domain-specific pre-trained word embeddings for the patent domain. We trained our model on a very large dataset of more than 5 million patents to learn the language use in this domain. We evaluated the quality of the resulting embeddings in the context of patent classification. To this end, we propose a deep learning approach based on gated recurrent units for automatic patent classification built on the trained word embeddings. Experiments on a standardized evaluation dataset show that our approach increases average precision for patent classification by 17 percent compared to state-of-the-art approaches.

Full Paper

ICDIM18.pdf

Conference Homepage

ICDIM 2018

BibTex Entry

@inproceedings{krestel-icdim18, author = {Risch, Julian and Krestel, Ralf}, booktitle = {Proceedings of the Thirteenth International Conference on Digital Information Management (ICDIM)}, month = {September 24--26}, pages = {63--68}, title = {Learning Patent Speak: Investigating Domain-Specific Word Embeddings}, year = {2018} }

« prev| top| next »

News

Watch our new MOOC in German about hate and fake in the Internet ("Trolle, Hass und Fake-News: Wie können wir das Internet retten?") on openHPI (link).

New Publication

Our work on Measuring and Comparing Dimensionality Reduction Algorithms for Robust Visualisation of Dynamic Text Collections will be presented at CHIIR 2021.

New Photos

I added some photos from my trip to Hildesheim.