Ralf Krestel

You are here: Home > Publications > Workshop Papers > TRAC 18a

About Me
Publications
- Book Chapters
- Journal Articles
- Conference Papers
- Workshop Papers
  - PST 21
  - LCHANGE 21
  - WOAH 21
  - ESIDA 21
  - FAPER 20
  - LWDA 20
  - TRAC 20a
  - TRAC 20b
  - AI4HI 20
  - GermEval 19
  - MIDAS 19
  - TRAC 18a
  - ALW 18
  - GermEval 18
  - TRAC 18b
  - DSMM 18
  - BigVis 18
  - LWDA 17a
  - LWDA 17b
  - DSMM 17
  - LWDA 16
  - Q4APS 16
  - SBD 16
  - LWA 15
  - TempWeb 15
  - ENRICH 13
  - NLPFrame 10
  - TAC 09
  - DC 09
  - TAC 08
  - RSDC 08
  - LaTeCH 08
  - DUC 07
  - DUC 06
  - SD 05
  - DUC 05
- Posters & Demos
- Proceedings
- Others
Travels

TRAC 18a

Aggression Identification Using Deep Learning and Data Augmentation

Abstract

Social media platforms allow users to share and discuss their opinions online. However, a minority of user posts is aggressive, thereby hinders respectful discussion, and — at an extreme level — is liable to prosecution. The automatic identification of such harmful posts is important, because it can support the costly manual moderation of online discussions. Further, the automation allows unprecedented analyses of discussion datasets that contain millions of posts. This system description paper presents our submission to the First Shared Task on Aggression Identification. We propose to augment the provided dataset to increase the number of labeled comments from 15,000 to 60,000. Thereby, we introduce linguistic variety into the dataset. As a consequence of the larger amount of training data, we are able to train a special deep neural net, which generalizes especially well to unseen data. To further boost the performance, we combine this neural net with three logistic regression classifiers trained on character and word n-grams, and hand-picked syntactic features. This ensemble is more robust than the individual single models. Our team named “Julian” achieves an F1-score of 60% on both English datasets, 63% on the Hindi Facebook dataset, and 38% on the Hindi Twitter dataset.

Full Paper

TRAC18a.pdf

Workshop Homepage

TRAC 2018

BibTex Entry

@InProceedings{krestel-tac18a, author = {Risch, Julian and Krestel, Ralf}, title = {{Aggression Identification Using Deep Learning and Data Augmentation}}, booktitle = {Proceedings of the 1st Workshop on Trolling, Aggression and Cyberbullying}, series = {TRAC'18}, year = {2018}, location = {Santa Fe, NM, USA}, pages = {--}, month = {August 25th}, publisher = {ACL} }

« prev| top| next »

News

Watch our new MOOC in German about hate and fake in the Internet ("Trolle, Hass und Fake-News: Wie können wir das Internet retten?") on openHPI (link).

New Publication

Our work on Measuring and Comparing Dimensionality Reduction Algorithms for Robust Visualisation of Dynamic Text Collections will be presented at CHIIR 2021.

New Photos

I added some photos from my trip to Hildesheim.