Data Mining and Probabilistic Reasoning
Lecturer
Abstract
Data arising from business transactions, scientific measurements and other forms of content-creation calls for automatic data mining and pattern recognition techniques that allow us to efficiently make sense of this data. At the same time these techniques should be able to handle uncertainty, as data from measurements may be imprecise and user-generated content may be unreliable.
This lecture will introduce the main concepts of data mining and probabilistic reasoning, ranging from basic probability and information theory to popular classification and clustering algorithms. An introduction to the exciting area of graphical models and probabilistic inference will highlight the link between uncertainty and probabilistic learning models.
Topics
raphical models
Literature
I. H. Witten, E. Frank, M. A. Hall: Data Mining - Practical Machine Learning Tools and Techniques (Chapters 1 - 6)
C. Bishop: Pattern Recognition and Machine Learning (Chapters 1 - 4, 8, 9)
T. M. Mitchell: Machine Learning (Chapters 3 - 6, 8, 10)
P. Flach: Machine Learning – The Art and Science of Algorithms that make Sense of Data (Chapters 1 – 3, 5 – 11)
D. J. C. MacKay: Information Theory, Inference and Learning Algorithms (Chapters 1 - 6)
Tools
Timetable
- Lectures:
- Tuesdays 13:30-15:00 in Room H-E.51
- Every second Thursday 11:00-12:30 in Room H-2.57
- Exercises:
- Every second Thursday 11:00-12:30 in Room H-2.57
- The assignments are available in the "Materialien" folder in the "Interner Bereich"
We thank all students for their help in pointing out errors in the slides!
The exam timetable and room information are now available in the "Interner Bereich"
Date | Topic | |
|---|---|---|
| 15.10.2013 | Introduction & examples | pdf (final) |
| 17.10.2013 | Basics of probability theory | pdf (final, Version 2 20.11.2013) |
| 22.10.2013 | Basics of statistics (part I) | pdf (final, Version 2 12.11.2013) |
| 24.10.2013 | Exercise 1 | available in "Interner Bereich" |
| 29.10.2013 | Basics of statistics (part II) | |
| 31.10.2013 | No lecture; public holiday | |
| 05.11.2013 | Basics of information theory | pdf (final) |
| 07.11.2013 | Exercise 2 | available in "Interner Bereich" |
| 12.11.2013 | Introduction to classification | pdf (final) |
| 14.11.2013 | Linear classification models (part I) | pdf (final, Version 2 06.12.2013) |
| 19.11.2013 | Linear classification models (part II) | |
| 21.11.2013 | Exercise 3 | available in "Interner Bereich" |
| 25.11.2013 | Linear classification models (part III) | |
| 28.11.2013 | canceled | |
| 02.12.2013 | Artifical Neural Networks (part I) | pdf (final) |
| 05.12.2013 | Exercise 4 | available in "Interner Bereich" |
| 10.12.2013 | Artifical Neural Networks (part II) | |
| 12.12.2013 | Non-linear classification models (part I) | pdf (final) |
| 17.12.2013 | Non-linear classification models (part II) | |
| 19.12.2013 | Exercise 5 | available in "Interner Bereich" |
| 07.01.2014 | Regression | pdf (final) |
| 09.01.2014 | General clustering algorithms | pdf (final) |
| 14.01.2014 | Clustering: Topic Models (part I) | pdf (final) |
| 16.01.2014 | Clustering: Topic Models (part II) (Note: Moved from Tuesday 21st) | |
| 21.01.2014 | Exercise 6 (Note: Moved from Thursday 16th) | available in "Interner Bereich" |
| 23.01.2014 | Clustering: Topic Models (part III) | |
| 28.01.2014 | Graphical Models (part I) | pdf (final, Version 2 05.02.2014) |
| 30.01.2014 | Exercise 7 | available in "Interner Bereich" |
| 04.02.2014 | Graphical Models (part II), Inference in graphical models | pdf (final) |
| 06.02.2014 | Summary and Exam preparation |
Exam
Condition for exam admission: oral presentation of at least two solutions during the tutorials
Form of exam: oral exam at the end of the term