Praktische Anwendung von Video Analyse Technologien (Wintersemester 2014/2015)

Dozent: Prof. Dr. Christoph Meinel (Internet-Technologien und -Systeme) , Dr. Haojin Yang (Internet-Technologien und -Systeme)

Allgemeine Information

Semesterwochenstunden: 4
ECTS: 6
Benotet: Ja
Einschreibefrist: 24.10.2014
Lehrform: Seminar
Belegungsart: Wahlpflichtmodul
Maximale Teilnehmerzahl: 20

Studiengänge, Modulgruppen & Module

IT-Systems Engineering BA

Beschreibung

In the last decade digital libraries and web video portals have become more and more popular. The amount of video data available on the World Wide Web (WWW) is growing rapidly. According to the official statistic-report of the popular video portal YouTube more than 6 billion hours of video are watched each month and about 100 hours of video are uploaded every minute. Therefore, how to efficiently retrieve video data on the web or within large video archives has become a very important and challenging task.

In this seminar, various methods for automatic video analysis and retrieval will be developed based on state-of-the-art computer vision technologies. The system accuracy and performance will be evaluated by using opened benchmark.

More information

In our current research, we focus on state-of-the-art techniques on video analysis and multimedia information retrieval (MIR). Potential topics include video Shot Boundary Detection (SBD), where a video stream will be separated into a set of representative key-frames. SBD often serves as a basis for further video analysis tasks. Video Text Detection (Video OCR) is one of the most intense research topics in MIR domain. Here we focus on improving existing approaches by using Deep-Learning techniques. Video Genre Classification is another topic attracted much more attention recently. An approach will be developed based on multimodal video information such as video key-frames, frame concepts, topics from video texts etc. The last topic is Real-time Video Object Tracking Applications. Various applications can be developed based on an existing object tracking approach, as e.g., interactive web navigation using object tracking algorithm.

Voraussetzungen

Strong interests in video/image processing, machine learning and/or computer vision
Software development in C/C++
Experience with OpenCV and machine learning applications as a plus

Literatur

Haojin Yang, Bernhard Quehl and Harald Sack, "A Framework for Improved Video Text Detection and Recognition", International Journal of MULTIMEDIA TOOLS AND APPLICATIONS (MTAP), special issue "Computer Vision for Multimedia", Volume 69 Number 1, pp 217-245. Publicher: Springer US, DOI: http://dx.doi.org/10.1007/s11042-012-1250-6, 2014
Epshtein, B.; Ofek, E.; Wexler, Y., "Detecting text in natural scenes with stroke width transform," Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on , vol., no., pp.2963,2970, 13-18 June 2010 doi: 10.1109/CVPR.2010.5540041
Tao Wang; Wu, D.J.; Coates, A; Ng, AY., "End-to-end text recognition with convolutional neural networks," Pattern Recognition (ICPR), 2012 21st International Conference on , vol., no., pp.3304,3308, 11-15 Nov. 2012
Andrej Karpathy* (Stanford), Sanketh Shetty (Google), George Toderici (Google), Rahul Sukthankar (Google), Thomas Leung (Google), Li Fei-Fei (Stanford University), “Large-scale Video Classification using Convolutional Neural Networks”, Int. Conference on Computer Vision and Pattern Recognition (CVPR ) 2014
Sidiropoulos, P.; Mezaris, V.; Kompatsiaris, I; Meinedo, H.; Bugalho, M.; Trancoso, I, "Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features," Circuits and Systems for Video Technology, IEEE Transactions on , vol.21, no.8, pp.1163,1177, Aug. 2011 doi: 10.1109/TCSVT.2011.2138830
Kalal, Z.; Mikolajczyk, K.; Matas, J., "Tracking-Learning-Detection," Pattern Analysis and Machine Intelligence, IEEE Transactions on , vol.34, no.7, pp.1409-1422, July 2012 doi: 10.1109/TPAMI.2011.239

Leistungserfassung

The final evaluation will be based on:

Initial implementation / idea presentation, 10%
Final presentation, 20%
Report/Documentation, 12-18 pages, 30%
Implementation, 40%
Participation in the seminar (bonus points)

Termine

Monday, 11.00-12.30

Room A-1.1

13.10.2014 und 20.10.2014 11:00-12:00	Vorstellung der Themen
24.10.2014 bis 23:59	Wahl der Themen
27.10.2014	Bekanntgabe der Themen- und Gruppenzuordnung
wöchentlich	Individuelle Meetings mit dem Betreuer
Anfang Dezember	Technologievorträge und geführte Diskussion (je 15+5min)
02.02.2015	Präsentation der Endergebnisse (je 15+5min)
08.02.2015 bis 23:59	Abgabe von Implementierung und Dokumentation
bis Ende Februar	Bewertung der Leistungen

Zurück