Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
  
 
  • Data Repeatability in Web Science
    Our article Measuring and Facilitating Data Repeatability in Web Science (Link to pre-print) has been accepted for publication in the Datenbank-Spektrum Journal. We publish source code corresponding to this article here.
  • Toxic Comment Classification
    We participated in the Toxic Comment Classification Challenge (Link), which was a Kaggle challenge with the goal to identify and classify toxic online comments. In collaboration with our colleagues from the DATEXIS group at Beuth Hochschule für Technik Berlin, we finished in the top 2% of the leaderboard and achieved 54th place out of 4551 teams.
  • Aggression Identification
    We participated in the First Shared Task on Aggression Identification (Link), which is part of the First Workshop on Trolling, Aggression and Cyberbullying at the 27th International Conference of Computational Linguistics (COLING 2018). Our team achieved 2nd place out of 30 teams at the task of classifying social media posts as ‘Overtly Aggressive’, ‘Covertly Aggressive’, or ‘Non-aggressive’ on an unseen test dataset. We will submit a description of our approach to COLING 2018 and publish the augmented dataset here under Creative Commons Non-Commercial Share-Alike 4.0 licence CC-BY-NC-SA 4.0. We publish the source code of our submission here.
  • Semi-Automated Comment Moderation 
    You can find code that accompanies our paper Delete or not Delete? Semi-Automatic Comment Moderation for the Newsroom here. The paper itself can be found here.