Ralf Krestel

You are here: Home > Publications > Conference Papers > JCDL 20b

About Me
Publications
- Book Chapters
- Journal Articles
- Conference Papers
  - ESWC 21
  - CHIIR 21
  - WI 20
  - CIKM 20
  - KI 20
  - ICWSM 20
  - JCDL 20a
  - JCDL 20b
  - TPDL 19
  - ICADL 18
  - ICDIM 18
  - JCDL 18a
  - JCDL 18b
  - NAACL 18
  - ECIR 18
  - ICDM 17
  - TPDL 17
  - NLDB 16
  - WI 15
  - KI 15
  - RECSYS 13
  - TPDL 13
  - HT 13
  - WI 11
  - KI 10
  - ECDL 10
  - WI 10
  - NLDB 10
  - LAWEB 09
  - RECSYS 09
  - WI 08
  - ASWC 08
  - LAWEB 08
  - LREC 08
  - RANLP 07
  - CanadianAI 07
- Workshop Papers
- Posters & Demos
- Proceedings
- Others
Travels

JCDL 20b

Visualising Large Document Collections by Jointly Modeling Text and Network Structure

Abstract

Many large text collections exhibit graph structures, either inherent to the content itself or encoded in the metadata of the individual documents. Example graphs extracted from document collections are co-author networks, citation networks, or named-entity-cooccurrence networks. Furthermore, social networks can be extracted from email corpora, tweets, or social media. When it comes to visualising these large corpora, either the textual content or the network graph are used. In this paper, we propose to incorporate both, text and graph, to not only visualise the semantic information encoded in the documents' content but also the relationships expressed by the inherent network structure. To this end, we introduce a novel algorithm based on multi-objective optimisation to jointly position embedded documents and graph nodes in a two-dimensional landscape. We illustrate the effectiveness of our approach with real-world datasets and show that we can capture the semantics of large document collections better than other visualisations based on either the content or the network information.

Full Paper

JCDL20b.pdf

Conference Homepage

JCDL 2020

BibTex Entry

@inproceedings{krestel-jcdl2020b, author = {Repke, Tim and Krestel, Ralf}, booktitle = {Proceedings of the Joint Conference on Digital Libraries (JCDL)}, month = {August 1--5}, title = {Visualising Large Document Collections by Jointly Modeling Text and Network Structure}, year = {2020} }

« prev| top| next »

News

Watch our new MOOC in German about hate and fake in the Internet ("Trolle, Hass und Fake-News: Wie können wir das Internet retten?") on openHPI (link).

New Publication

Our work on Measuring and Comparing Dimensionality Reduction Algorithms for Robust Visualisation of Dynamic Text Collections will be presented at CHIIR 2021.

New Photos

I added some photos from my trip to Hildesheim.