Ralf Krestel

You are here: Home > Publications > Conference Papers > JCDL 18b

About Me
Publications
- Book Chapters
- Journal Articles
- Conference Papers
  - ESWC 21
  - CHIIR 21
  - WI 20
  - CIKM 20
  - KI 20
  - ICWSM 20
  - JCDL 20a
  - JCDL 20b
  - TPDL 19
  - ICADL 18
  - ICDIM 18
  - JCDL 18a
  - JCDL 18b
  - NAACL 18
  - ECIR 18
  - ICDM 17
  - TPDL 17
  - NLDB 16
  - WI 15
  - KI 15
  - RECSYS 13
  - TPDL 13
  - HT 13
  - WI 11
  - KI 10
  - ECDL 10
  - WI 10
  - NLDB 10
  - LAWEB 09
  - RECSYS 09
  - WI 08
  - ASWC 08
  - LAWEB 08
  - LREC 08
  - RANLP 07
  - CanadianAI 07
- Workshop Papers
- Posters & Demos
- Proceedings
- Others
Travels

JCDL 18b

WELDA: Enhancing Topic Models by Incorporating Local Word Context

Abstract

The distributional hypothesis states that similar words tend to have similar contexts in which they occur. Word embedding models exploit this hypothesis by learning word vectors based on the local context of words. Probabilistic topic models on the other hand utilize word co-occurrences across documents to identify topically related words. Due to their complementary nature, these models define different notions of word similarity, which, when combined, can produce better topical representations. In this paper we propose WELDA, a new type of topic model, which combines word embeddings (WE) with latent Dirichlet allocation (LDA) to improve topic quality. We achieve this by estimating topic distributions in the word embedding space and exchanging selected topic words via Gibbs sampling from this space. We present an extensive evaluation showing that WELDA cuts runtime by at least 30% while outperforming other combined approaches with respect to topic coherence and for solving word intrusion tasks.

Full Paper

JCDL18b.pdf

Conference Homepage

JCDL 2018

BibTex Entry

@InProceedings{krestel-jcdl18b, author = {Stefan Bunk and Ralf Krestel}, title = {WELDA: Enhancing Topic Models by Incorporating Local Word Contexts}, booktitle = {Joint Conference on Digital Libraries (JCDL 2018)}, pages = {293--302}, location = {Forth Worth, Texas, USA}, month = {June 3--6}, year = {2018}, publisher = {ACM} }

« prev| top| next »

News

Watch our new MOOC in German about hate and fake in the Internet ("Trolle, Hass und Fake-News: Wie können wir das Internet retten?") on openHPI (link).

New Publication

Our work on Measuring and Comparing Dimensionality Reduction Algorithms for Robust Visualisation of Dynamic Text Collections will be presented at CHIIR 2021.

New Photos

I added some photos from my trip to Hildesheim.