Lecturer

Teaching Assistant

Abstract

Data arising from business transactions, scientific measurements and other forms of content-creation calls for automatic data mining and pattern recognition techniques that allow us to efficiently make sense of this data. At the same time these techniques should be able to handle uncertainty, as data from measurements may be imprecise and user-generated content may be unreliable. This lecture will introduce the main concepts of data mining and machine learning, ranging from basic probability and information theory to popular classification, clustering, and regression algorithms.

Prerequisites

Although we will review basic probability theory and statistics, prior knowledge in these areas are useful. Further, we will make heavy use of linear algebra and a fundamental understanding thereof is necessary.

Literature

P. Flach: Machine Learning - The Art and Science of Algorithms that make Sense of Data (Chapters 1 - 3, 5 - 12)
K. Murphy: Machine Learning - A Probabilistic Perspective (Chapters 1 - 8, 10 - 12, 19, 27)
C. Bishop: Pattern Recognition and Machine Learning (Chapters 1 - 4, 8, 9)

Timetable

Lectures:
- Monday 13:30 in Room D-E.9/10
- Every second Wednesday 13:30 in Room D-E.9/10 starting 15.4.15
- Slides will be available in the "Materialien" folder in the "Interner Bereich"
Exercises:
- Every other second Wednesday 13:30 in Room D-E.9/10 starting 22.4.15
- The homework assignments are available in the "Materialien" folder in the "Interner Bereich"

	Chapter	Date	Topic	Literature
1	Introduction	13.4.15	What is Data Mining?
		15.4.15	Binary Classification
		20.4.15	Beyond Binary Classification
		22.4.15	1. Exercise Introduction
2	Basics	27.4.15	Introduction to Statistics
		29.4.15	Explorative Data Analysis
		4.5.15	Features
		6.5.15	2. Exercise Basics
		11.5.15	Experiments
3	Logical Models	13.5.15	Tree Models	decisionTrees.pdf
		18.5.15	Rule Models	associationRules.pdf
		20.5.15	3. Exercise Logical Models
		25.5.15	Holiday
4	Geometric Models	27.5.15	Linear Models I	ridgeRegression.pdf
		1.6.15	Linear Models II	svm.pdf
		3.6.15	Distance-based Models	kMeans.pdf
		8.6.15	4. Exercise Geometric Models
5	Probabilistic Models	10.6.15	Introduction to Probability Theory	naiveBayes.pdf
		15.6.15	Gaussian Models	kernelLda.pdf
		17.6.15	Linear Models III	generalizedLinearModels.pdf
		22.6.15	5. Exercise Probabilistic Models
6	Graphical Models	24.6.15	Bayes Nets	bayesNets.pdf
		29.6.15	Markov Random Fields	crf.pdf
		1.7.15	No Lecture
		6.7.15	Mixture Models and EM	em.pdf
		8.7.15	Topic Models	lda.pdf
		13.7.15	6. Exercise Graphical Models
7	Combining Models	15.7.15	Ensembles	adaBoost.pdf
		20.7.15	Artificial Neural Networks	deepLearning.pdf
		22.7.15	7. Exercise Combining Models

Grading

There will be an oral exam at the end of the term.

Condition for admission is successful homework and exercise participation.