Hasso-Plattner-InstitutSDG am HPI
Hasso-Plattner-InstitutDSG am HPI

Social Media Mining (Wintersemester 2014/2015)

Dozent: Prof. Dr. Christoph Meinel (Internet-Technologien und -Systeme)

Allgemeine Information

  • Semesterwochenstunden: 4
  • ECTS: 6
  • Benotet: Ja
  • Einschreibefrist: 24.10.2014
  • Lehrform: SP
  • Belegungsart: Wahlpflichtmodul
  • Maximale Teilnehmerzahl: 10

Studiengänge, Modulgruppen & Module

IT-Systems Engineering BA
IT-Systems Engineering MA
  • IT-Systems Engineering A
  • IT-Systems Engineering B
  • IT-Systems Engineering C
  • IT-Systems Engineering D
  • IT-Systems Engineering Analyse


With a wide circulation of 180 million weblogs worldwide, weblogs with good reason are one of the killer applications of the worldwide web. It was already shown on several occasions that it can be highly meaningful for individuals, institutions or even governments to find ways and measures to extract information out of the blogosphere.

However, it is increasingly difficult - if not impossible - for the average internet user and weblog enthusiast to grasp the blogosphere’s complexity as a whole, due to thousands of new weblogs and an almost uncountable number of new posts adding up to the before-mentioned collective on a daily basis.

Therefore, mining, analyzing, modeling and presenting this immense data collection is of central interest. This could enable the user to detect technical trends, political atmospheric pictures or news articles about a specific topic.

More information about BlogIntelligence

In this seminar, we focus on understanding social media streams  like the blogosphere. To reveal potential relationships or visualize the dynamics of social media, various data mining technologies will be used within the selected topics in this seminar.

Link to Topic Presentation


Good knowledge in

  • Operating Systems and Software Engineering
  • Internet Basics
  • Basics in Data Mining


Checkout our Papers:

  • 2010Bross, Justus and Quasthoff, Matthias and Berger, Philipp and Hennig, Patrick and Meinel, Christoph
    Mapping the blogosphere with rss-feeds
  • 2010Bross, Justus and Berger, P and Hennig, P and Meinel, Christoph
    RSS-Crawler enhancement for blogosphere-mapping
  • 2011Berger, Philipp and Hennig, Patrick and Bross, Justus and Meinel, Christoph
    Mapping the Blogosphere--Towards a universal and scalable Blog-Crawler
  • 2013Hennig, Patrick and Berger, Philipp and Meinel, Christoph
    Identify emergent trends based on the blogosphere
  • Hennig, Patrick and Berger, Philipp and Godde, Christian and Hoffmann, Daniel and Meinel, Christoph
    A Fuzzy, Incremental, Hierachical Approach of Clustering Huge Collections of Web Documents
  • 2013Berger, Philipp and Hennig, Patrick and Klingbeil, Thomas and Kohnen, Matthias and Pade, Steffen and Meinel, Christoph
    Mining the Boundaries of Social Networks: Crawling Facebook and Twitter for BlogIntelligence
  • 2013Hennig, Patrick and Berger, Philipp and Meinel, Christoph and Graber, Maria and Hildebrandt, Jens and Lehmann, Stefan and Ramson, Cathleen
    Tracking Visitor Engagement in the Blogosphere for Leveraging Rankings
  • 2013Hennig, Patrick and Berger, Philipp and Meinel, Christoph
    Web Mining Accelerated with In-Memory and Column Store Technology
  • 2013Berger, Philipp and Hennig, Patrick and Meinel, Christoph
    Identifying Domain Experts in the Blogosphere--Ranking Blogs Based on Topic Consistency
  • 2014Berger, Philipp and Hennig, Patrick and Detje, Stephan
    BlogSphere-A Topical Map of the Blogosphere


The final evaluation will be based on:

  • Initial implementation / idea presentation, 15%
  • Final presentation, 25%
  • Report, 12-18p LNCS, 30%
  • Implementation, 15%
  • Integration, 15%
  • Participation in the seminar, paper review (bonus points)


Tuesdays, 9.15-10:45. Room A-2.2