Hasso-Plattner-Institut
  
    • de
 

Cheng Wang

Hasso Plattner Institute, University of Potsdam

Prof.-Dr.-Helmert-Str. 2-3

D-14482 Potsdam

Germany

Room: H-1.22

Tel: +49(0)331 5509-546

emal: cheng.wang(at)hpi.de

 

Research

I am interested in Deep Learning, Computer Vision and Multimedia Analysis. My PhD research topic is "Deep Learning for Multimodal Data Understanding"

Publications

In Journals

  • Cheng Wang, Haojin Yang and Christoph Meinel, "Image Captioning with Deep Bidirectional LSTMs and Multi-task Learning"; ( ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) ) (Under Review)
  • Cheng Wang, Haojin Yang and Christoph Meinel, "Deep Metric Learning of Video Representation using Siamese Neural Network" (IET Computer Vision) (Under Review)
  • Cheng Wang, Haojin Yang and Christoph Meinel, "A Deep Semantic Framework for Multimodal Representation Learning", Multimedia Tools and Applications (MTAP) (Impact Factor: 1.331), DOI: 10.1007/s11042-016-3380-8, online ISSN:1573-7721, Print ISSN:1380-7501,  Special Issue: "Representation Learning for Multimedia Data Understanding", March 2016 [link] [PDF] [BibTex]

In Conferences

  • Haojin Yang, Cheng Wang, Christian Bartz, Christoph Meinel "SceneTextReg: A Real-Time Video OCR System", ACM international conference on Multimedia (ACM MM 2016), system demonstration, 15-19 October 2016, Amsterdam, The Netherlands [demo video
  • Cheng Wang, Haojin Yang, Christian Bartz, Christoph Meinel "Image Captioning with Deep Bidirectional LSTMs", ACM international conference on Multimedia (ACM MM 2016), Full paper, Oral Presentation, 15-19 October 2016, Amsterdam, The Netherlands [PDF copy] [demo video
  • Xiaoyin Che, Cheng Wang, Haojin Yang and Christoph Meinel, "Punctuation Prediction for Unsegmented Transcript Based on Word Vector", "the 10th International Conference on Language Resources and Evaluation (LREC 2016)", Portorož (Slovenia), 23-28 May 2016
  • Cheng Wang, Haojin Yang and Christoph Meinel, "Exploring Multimodal Video Representation for Action Recognition", the annual International Joint Conference on Neural Networks (IJCNN 2016), Vancouver, Canada, July 24-29, 2016 (to appear)
  • Sheng Luo, Haojin Yang, Cheng Wang, Xiaoyin Che, and Christoph Meinel, "Action Recognition in Surveillance Video Using ConvNets and Motion History Image", International Conference on Artificial Neural Networks (ICANN 2016), Barcelona Spain, 6th-9th of September 2016 (to appear)
  • Sheng Luo, Haojin Yang, Cheng Wang, Xiaoyin Che and Christoph Meinel, "Real-time action recognition in surveillance videos using ConvNets", in the 23rd International Conference on Neural Information Processing (ICONIP 2016), in Kyoto (Japan), 16th – 12th of October 2016 (to appear)
  • Haojin Yang, Cheng Wang, Xiaoyin Che, Sheng Luo and Christoph Meinel. “An Improved System For Real-Time Scene Text Recognition”, ACM International Conference on Multimedia Retrieval (ICMR 2015), system demonstration session, Shanghai, June 23-26, 2015
  • Cheng Wang, Haojin Yang and Christoph Meinel, "Does Multilevel Semantic Representation Improve Text Categorization?", the 26th International Conference on Database and Expert Systems Applications (DEXA 2015), Valencia, Spain, September 1-4, 2015
  • Cheng Wang, Haojin Yang and Christoph Meinel, "Visual-Textual Late Semantic Fusion Using Deep Neural Network for Document Categorization",  the 22nd International Conference on Neural Information Processing (ICONIP2015), Istanbul, Turkey, November 9-12, 2015
  • Cheng Wang, Haojin Yang, Christoph Meinel, "Deep Semantic Mapping for Cross-Modal Retrieval",  the 27th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2015), Vietri sul Mare, Italy, November 9-11, 2015
  • Cheng Wang, Haojin Yang, Xiaoyin Che and Christoph Meinel, "Concept-Based Multimodal Learning for Topic Generation", the 21st MultiMedia Modelling Conference (MMM2015), Sydney, Australia, Jan 5-7, 2015

Projects

  • Automatic Image Descriptions Generation
  • Human Action Recognition in Videos
  • Multimodal/Cross-modal Retrieval

Teaching and Mentoring

  • Summer Semester 2016: Video Captioning with Deep Learning
  • Winter Semester 2015/2016: Languages Identification with Deep Learning
  • Summer Semester 2015: Deep Learning for Video Classification