Hasso-Plattner-Institut25 Jahre HPI
Hasso-Plattner-Institut25 Jahre HPI
Login
 

Praktische Anwendung von Video Analyse Technologien (Wintersemester 2014/2015)

Dozent: Prof. Dr. Christoph Meinel (Internet-Technologien und -Systeme) , Dr. Haojin Yang (Internet-Technologien und -Systeme)

Allgemeine Information

  • Semesterwochenstunden: 4
  • ECTS: 6
  • Benotet: Ja
  • Einschreibefrist: 24.10.2014
  • Lehrform: Seminar
  • Belegungsart: Wahlpflichtmodul
  • Maximale Teilnehmerzahl: 20

Studiengänge, Modulgruppen & Module

IT-Systems Engineering BA

Beschreibung

In the last decade digital libraries and web video portals have become more and more popular. The amount of video data available on the World Wide Web (WWW) is growing rapidly. According to the official statistic-report of the popular video portal YouTube  more than 6 billion hours of video are watched each month and about 100 hours of video are uploaded every minute. Therefore, how to efficiently retrieve video data on the web or within large video archives has become a very important and challenging task.

In this seminar, various methods for automatic video analysis and retrieval will be developed based on state-of-the-art computer vision technologies. The system accuracy and performance will be evaluated by using opened benchmark.

More information

In our current research, we focus on state-of-the-art techniques on video analysis and multimedia information retrieval (MIR). Potential topics include video Shot Boundary Detection (SBD), where a video stream will be separated into a set of representative key-frames. SBD often serves as a basis for further video analysis tasks. Video Text Detection (Video OCR) is one of the most intense research topics in MIR domain. Here we focus on improving existing approaches by using Deep-Learning techniques. Video Genre Classification is another topic attracted much more attention recently. An approach will be developed based on multimodal video information such as video key-frames,  frame concepts, topics from video texts etc. The last topic is Real-time Video Object Tracking Applications. Various applications can be developed based on an existing object tracking approach, as e.g., interactive web navigation using object tracking algorithm.

Voraussetzungen

  • Strong interests in video/image processing, machine learning and/or computer vision

  • Software development in C/C++

  • Experience with OpenCV and machine learning applications as a plus

Literatur

  • Haojin Yang, Bernhard Quehl and Harald Sack, "A Framework for Improved Video Text Detection and Recognition", International Journal of MULTIMEDIA TOOLS AND APPLICATIONS (MTAP), special issue "Computer Vision for Multimedia", Volume 69 Number 1, pp 217-245. Publicher: Springer US, DOI: http://dx.doi.org/10.1007/s11042-012-1250-6, 2014

  • Epshtein, B.; Ofek, E.; Wexler, Y., "Detecting text in natural scenes with stroke width transform," Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , vol., no., pp.2963,2970, 13-18 June 2010 doi: 10.1109/CVPR.2010.5540041

  • Tao Wang; Wu, D.J.; Coates, A; Ng, AY., "End-to-end text recognition with convolutional neural networks," Pattern Recognition (ICPR), 2012 21st International Conference on , vol., no., pp.3304,3308, 11-15 Nov. 2012

  • Andrej Karpathy* (Stanford), Sanketh Shetty (Google), George Toderici (Google), Rahul Sukthankar (Google), Thomas Leung (Google), Li Fei-Fei (Stanford University), “Large-scale Video Classification using Convolutional Neural Networks”, Int. Conference on Computer Vision and Pattern Recognition (CVPR ) 2014

  • Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I; Meinedo, H.; Bugalho, M.; Trancoso, I, "Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features," Circuits and Systems for Video Technology, IEEE Transactions on , vol.21, no.8, pp.1163,1177, Aug. 2011 doi: 10.1109/TCSVT.2011.2138830

  • Kalal, Z.; Mikolajczyk, K.; Matas, J., "Tracking-Learning-Detection," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.34, no.7, pp.1409-1422, July 2012 doi: 10.1109/TPAMI.2011.239

Leistungserfassung

The final evaluation will be based on:

  • Initial implementation / idea presentation, 10%

  • Final presentation, 20%

  • Report/Documentation, 12-18 pages, 30%

  • Implementation, 40%

  • Participation in the seminar (bonus points)

Termine

Monday, 11.00-12.30

Room A-1.1

13.10.2014 und 20.10.2014 11:00-12:00

Vorstellung der Themen

24.10.2014 bis 23:59

Wahl der Themen

27.10.2014

Bekanntgabe der Themen- und Gruppenzuordnung

wöchentlich

Individuelle Meetings mit dem Betreuer

Anfang Dezember

Technologievorträge und geführte Diskussion (je 15+5min)

02.02.2015

Präsentation der Endergebnisse (je 15+5min)

08.02.2015 bis 23:59

Abgabe von Implementierung und Dokumentation

bis Ende Februar

Bewertung der Leistungen

Zurück