Hasso-Plattner-Institut
Prof. Dr. Felix Naumann
  
 

Text Mining in Practice

Advisors: Dr. Ralf Krestel, Julian Risch

The project seminar takes place Wednesdays from 13:30 to 15:00 in G-E 15/16 on Campus III.

There will be weekly individual group meetings (held in German or English) and occasional meetings with all groups during the seminar slot (held in English) to exchange ideas and present results.

6 ECTS, 4 SWS, ITSE/OSIS/ISAE

Course Description

Text Mining is the process of extracting and deriving information from unstructured text data. It involves Natural Language Processing and Machine Learning Methods, such as Probabilistic Graphical Models and Deep Learning.

This project seminar is about the practical application of Text Mining. Its focus is on a real-world dataset of millions of English-language comments that have been posted by users at discussion sections of online newspapers. Each group of students will work on one out of several different topics, such as:

  • Hate Speech Detection
  • Comment Volume Prediction
  • Comment Ranking
  • Modeling User's Commenting Behavior
  • Discussion Summarization
  • Modeling Linguistic Change

Grading

The grade will consist of

  • 20% Mid-Term Presentation
  • 20% Final Presentation
  • 30% Implementation
  • 30% Paper (4 pages, ACM Template)

References

  • Machine Learning - The Art and Science of Algorithms that Make Sense of Data by Peter Flach
  • ...

Schedule

Date Topic Presenter
11.04.18 Introduction I Ralf Krestel, Julian Risch
18.04.18 optional Deep Learning Tutorial Julian Risch
25.04.18 Introduction II Julian Risch
02.05.18
09.05.18 Discussion Students
16.05.18
23.05.18
30.05.18 Mid-Term Presentations Students
06.06.18
13.06.18 Discussion Students, Industry Partner
20.06.18
27.06.18 Writing Research Papers Ralf Krestel
04.07.18
11.07.18
18.07.18 Final Presentations Students
01.08.18 Paper Submission Deadline