Hasso-Plattner-Institut25 Jahre HPI
Hasso-Plattner-Institut25 Jahre HPI

Social Media Analyses (Wintersemester 2013/2014)

Dozent: Prof. Dr. Christoph Meinel (Internet-Technologien und -Systeme) , Patrick Hennig (Internet-Technologien und -Systeme) , M.Sc. Philipp Berger (Internet-Technologien und -Systeme)

Allgemeine Information

  • Semesterwochenstunden: 4
  • ECTS: 6
  • Benotet: Ja
  • Einschreibefrist: 1.10.2013 - 31.10.2013
  • Lehrform: SP
  • Belegungsart: Wahlpflichtmodul
  • Maximale Teilnehmerzahl: 10

Studiengänge, Modulgruppen & Module

IT-Systems Engineering BA
IT-Systems Engineering MA
  • IT-Systems Engineering A
  • IT-Systems Engineering B
  • IT-Systems Engineering C
  • IT-Systems Engineering D


With a wide circulation of 180 million weblogs worldwide, weblogs with good reason are one of the killer applications of the worldwide web. It was already shown on several occasions that it can be highly meaningful for individuals, institutions or even governments to find ways and measures to extract information out of the blogosphere.

However, it is increasingly difficult - if not impossible - for the average internet user and weblog enthusiast to grasp the blogosphere’s complexity as a whole, due to thousands of new weblogs and an almost uncountable number of new posts adding up to the before-mentioned collective on a daily basis.

Therefore, mining, analyzing, modeling and presenting this immense data collection is of central interest. This could enable the user to detect technical trends, political atmospheric pictures or news articles about a specific topic.

More information about BlogIntelligence

In this seminar, we focus on understanding social media streams  like the blogosphere. To reveal potential relationships or visualize the dynamics of social media, various data mining technologies will be used within the selected topics in this seminar.


Good knowledge in

  • Operating Systems and Software Engineering
  • Internet Basics
  • Basics in Data Mining


  • Philipp Berger, Patrick Hennig, Justus Bross, Christoph Meinel Mapping the Blogosphere — Towards a Universal and Scalable Blog-Crawler,   in: Third IEEE International Conference on Social Computing (SocialCom 2011), IEEE CS, MIT, Boston, USA , 10, 2011. ISBN: 978-0-7695-4578-3.
  • Justus Bross, Patrick Schilf, Maximilian Jenders, Christoph Meinel Visualizing the Blogosphere with BlogConnect,   in: Third IEEE International Conference on Social Computing (SocialCom 2011), IEEE CS, MIT, Boston, USA , 10, 2011. ISBN: 978-0-7695-4578-3.
  • Justus Bross, Keven Richly, Matthias Kohnen, Christoph Meinel Identifying the top-dogs of the blogosphere,   in:SOCIAL NETWORK ANALYSIS AND MINING Volume 2, Number 1, 53-67, DOI: 10.1007/s13278-011-0027-7
  • J. Bross, K. Richly, P. Schilf, C. Meinel, Social Physics of the Blogosphere: Capturing, Analyzing and Presenting Interdependencies of Partial Blogospheresin: “From Sociology to Computing in Social Networks“ in: “Theory, Foundations and Applications Series: Lecture Notes in Social Networks, Vol. 1″, Memon, Nasrullah; Alhajj, Reda (Eds.), ISBN: 978-3-7091-0293-0, Springer: NewYork/Wien, 2010
  • J. Bross, P. Hennig, P. Berger, C. Meinel, Feed-Crawler Enhancement for Blogosphere-Mapping, International Journal of Advanced Computer Science and Applications IJACSA, Vol. 1, No.2,  US.ISSN: 2156-5570 (Online), August 2010
  • J.  Bross, M. Quasthoff, P. Berger, P. Hennig, C. Meinel, Mapping the blogosphere with RSS-feeds, 24th IEEE International Conference on Advanced Information Networking and Applications (AINA-2010) – Perth, Australia, 20-23 April 2010
  • J. Bross, P. Schilf, C. Meinel, Visualizing blog archives to explore content- and context-related interdependencies, 2010 IEEE/WIC/ACM International Conference on Web Intelligence (WI’10), Toronto, Kanada, 2010

Lern- und Lehrformen

This seminar has a limited number of participants! Up to 10 students can attend! The course enrollment depends on the assigned topics.

All students will work on a specific topic.
Usually two students per team, but it depends on the selected topic.

To participate in the seminar you have to attend the kickoff meeting on 15.10.2013.


The presentation of the kick-off meeting can be found here.

To enroll please follow exactly the steps from the slides!!!

The topics are:

  • Hierarchical Cluster Labeling
  • Machine Learning for Blog Type Detection
  • Printerest - Content Curation
  • Inteview Crawling / Mining
  • Location Tagging for Blog Posts
  • Ontology-based Query Enrichment
  • Event Detection
  • HeatMap - Visualization
  • the final set of topics will be announced during the seminar


The final evaluation will be based on:

  • Initial implementation / idea presentation, 15%
  • Final presentation, 25%
  • Report, 12-18p LNCS, 30%
  • Implementation, 15%
  • Integration, 15%
  • Participation in the seminar, paper review (bonus points)


Tuesdays, 9.15-10.45

Room A-2.2