Hasso-Plattner-InstitutSDG am HPI
Hasso-Plattner-InstitutDSG am HPI

NLP and Computer Vision with Deep Learning Research Seminar (Wintersemester 2023/2024)

Lecturer: Prof. Dr. Gerard de Melo (Artificial Intelligence and Intelligent Systems) , Jingyi Zhang (Artificial Intelligence and Intelligent Systems)
Course Website: https://moodle.hpi.de/course/view.php?id=670

General Information

  • Weekly Hours: 4
  • Credits: 6
  • Graded: yes
  • Enrolment Deadline: 01.10.2023 - 31.10.2023
  • Examination time §9 (4) BAMA-O: 22.12.2023
  • Teaching Form: Project seminar
  • Enrolment Type: Compulsory Elective Module
  • Course Language: English

Programs, Module Groups & Modules

IT-Systems Engineering MA
Data Engineering MA
Digital Health MA
Software Systems Engineering MA
  • DSYS: Data-Driven Systems
    • HPI-DSYS-C Concepts and Methods
  • DSYS: Data-Driven Systems
    • HPI-DSYS-T Technologies and Tools
  • DSYS: Data-Driven Systems
    • HPI-DSYS-S Specialization
  • MALA: Machine Learning and Analytics
    • HPI-MALA-C Concepts and Methods
  • MALA: Machine Learning and Analytics
    • HPI-MALA-T Technologies and Tools
  • MALA: Machine Learning and Analytics
    • HPI-MALA-S Specialization


Deep Learning is the foundation for most modern approaches to AI, especially for natural language processing (NLP) and computer vision (CV). Recent advances such as Transformers, CLIP, ChatGPT show the effectivess of  deep learning solutions in tackling many complex single modality and multimodal problems.

This seminar aims to build upon recent research in NLP, CV, and deep learning.

Potential topics that could be explored in this seminar include Transformer models for NLP, vision–and–language models, startup success prediction, etc.


The main focus is on research, so depending on the topic some prior familiarity with ML/AI, especially Deep Learning, is probably needed. For many topics, you will need some prior experience with either PyTorch, Tensorflow, or Jax, and prior experience in training deep neural networks with GPUs. For example, you can take our "Natural Language Processing" course, Christoph Lippert's Deep Learning course, or Dagmar Kainmüller's computer vision course to acquire the prerequisite knowledge.


This seminar focuses on practical research skills. Depending on the topic, you can either investigate it alone or in a team. Students will work on these projects throughout the semester, supported by weekly meetings with their mentor.


The grade will be based on the following:

  • 25% Final Presentation
  • 75% Project (7 to 10-Page Paper and Code Submission)


Important criteria for the evaluation of the project include the project effort, the quality of the paper, and the reproducibility of the code. Further details will be given during the seminar.


Potential topics are presented on the first day (Tuesday, Oct. 17, 15:15-16:45 in HPI Building K, room K-2.04).

There will be a mid-term presentation during the semester in addition to a final presentation at the end of the semester.