Hasso-Plattner-InstitutSDG am HPI
Hasso-Plattner-InstitutDSG am HPI
Login
 

Applied Probabilistic Machine Learning (Wintersemester 2022/2023)

Lecturer: Prof. Dr. Bernhard Renard (Data Analytics and Computational Statistics) , Hugues Richard (Data Analytics and Computational Statistics) , Jens-Uwe Ulrich (Data Analytics and Computational Statistics)
Course Website: https://moodle.hpi.de/course/view.php?id=222

General Information

  • Weekly Hours: 4
  • Credits: 6
  • Graded: yes
  • Enrolment Deadline: 01.10.2022 -31.10.2022
  • Examination time §9 (4) BAMA-O: 25.11.2022
  • Teaching Form: Seminar
  • Enrolment Type: Compulsory Elective Module
  • Course Language: English
  • Maximum number of participants: 20

Programs, Module Groups & Modules

IT-Systems Engineering MA
Data Engineering MA
Digital Health MA
  • SCAD: Scalable Computing and Algorithms for Digital Health
    • HPI-SCAD-C Concepts and Methods
  • SCAD: Scalable Computing and Algorithms for Digital Health
    • HPI-SCAD-T Technologies and Tools
  • SCAD: Scalable Computing and Algorithms for Digital Health
    • HPI-SCAD-S Specialization
  • APAD: Acquisition, Processing and Analysis of Health Data
    • HPI-APAD-C Concepts and Methods
  • APAD: Acquisition, Processing and Analysis of Health Data
    • HPI-APAD-T Technologies and Tools
  • APAD: Acquisition, Processing and Analysis of Health Data
    • HPI-APAD-S Specialization
Cybersecurity MA
Software Systems Engineering MA

Description

In all areas of health and life science, large amounts of data are generated. Data analysis, visualisation and prediction necessitate the design and implementation of dedicated procedures. It consists in the first step towards predictions and inference based for decision making.

Computational statistical methods have evolved to cope with challenges arising from large datasets that are not tractable with traditional approaches, e.g. when the dimensionality of the data largely exceeds the number of observations.

In this course we will learn to use probabilistic models as a principled tool to describe the dependencies between the variables of a system, while accounting for uncertainties. Whereas highly efficient machine learning strategies often work as black boxes, explicit probabilistic models provide a way for users to preserve explainability while still solving a large range of problems.

We will see how inference with those models can be cast in a practical framework for typical machine learning tasks such as classification or clustering. The course will be focused on Graphical Models, a class of models that can capture conditional dependencies between variables and have proven being useful for multiple application (document classification, gene detection, ...). Those models will be applied to realistic case studies through small projects.

Learning Objectives:

  • Understand all the typical variations of graphical models (mixture models, Markov, Hidden Markov Models, Bayesian Networks, Topic models, Probabilistic PCA).
  • Understand techniques for evaluating the fit of a model and inferring its parameters (Optimisation and EM algorithm)
  • Apply those models to Machine learning tasks (Supervised/Unsupervised learning)
  • Understand the underlying algorithmic challenges for training models and analyse them.
  • Be able to design and train a model for a given task.

Requirements

  • Good knowledge in calculus (functional optimisation) and linear algebra is recommended
  • Knowledge of key statistical concepts (a refresher will be provided in the first lectures)
  • Basic knowledge of either Python or R programming language or very good skills in a third programming language
  • The course will be given in English

Literature

The course will be mainly based on the following book available online [here]

  • Bishop, Pattern recognition and Machine learning, Springer.

The following books will be used for complementary information:

  • Duda, Hart and Stork: Pattern classification
  • Murphy, Machine Learning: a probabilistic perspective
  • Koller and Friedman: Probabilistic Graphical Models

Examination

  • Returning the small projects (50% of final grade)
  • Presentation of small projects (at least one per semester) (40% of final grade)
  • Student Participation (10% of final grade)
  • Students have to give a talk about their solution of an exercise at least once in the semester

Zurück