Data Analysis using R, 1 - Statistical Epidemiology (Wintersemester 2019/2020)
Dozent:
Dr. rer. nat. Stefan Konigorski
(Digital Health - Machine Learning)
Website zum Kurs:
https://moodle2.uni-potsdam.de/course/index.php?categoryid=1532
Allgemeine Information
- Semesterwochenstunden: 4
- ECTS: 6
- Benotet:
Ja
- Einschreibefrist: 01.10. - 30.10.2019
- Lehrform: Vorlesung / Seminar
- Belegungsart: Wahlpflichtmodul
- Lehrsprache: Englisch
- Maximale Teilnehmerzahl: 30
Studiengänge, Modulgruppen & Module
- OSIS: Operating Systems & Information Systems Technology
- HPI-OSIS-K Konzepte und Methoden
- OSIS: Operating Systems & Information Systems Technology
- HPI-OSIS-S Spezialisierung
- OSIS: Operating Systems & Information Systems Technology
- HPI-OSIS-T Techniken und Werkzeuge
- IT-Systems Engineering
- IT-Systems Engineering
- IT-Systems Engineering
- IT-Systems Engineering
- DATA: Data Analytics
- HPI-DATA-K Konzepte und Methoden
- DATA: Data Analytics
- HPI-DATA-T Techniken und Werkzeuge
- PREP: Data Preparation
- HPI-PREP-K Konzepte und Methoden
- PREP: Data Preparation
- HPI-PREP-T Techniken und Werkzeuge
- APAD: Acquisition, Processing and Analysis of Health Data
- HPI-APAD-C Concepts and Methods
- APAD: Acquisition, Processing and Analysis of Health Data
- HPI-APAD-T Technologies and Tools
- APAD: Acquisition, Processing and Analysis of Health Data
- HPI-APAD-S Specialization
- SCAD: Scalable Computing and Algorithms for Digital Health
- HPI-SCAD-C Concepts and Methods
- SCAD: Scalable Computing and Algorithms for Digital Health
- HPI-SCAD-S Specialization
- SCAD: Scalable Computing and Algorithms for Digital Health
- HPI-SCAD-T Technologies and Tools
- DICR: Digitalization of Clinical and Research Processes
- HPI-DICR-C Concepts and Methods
- DICR: Digitalization of Clinical and Research Processes
- HPI-DICR-S Specialization
- DICR: Digitalization of Clinical and Research Processes
- HPI-DICR-T Technologies and Tools
Beschreibung
This course teaches biostatistical methods and their application for data analysis of large epidemiological datasets using the statistical software R (www.r-project.org) and the graphical interface RStudio (www.rstudio.com). To this aim, the class starts with an introduction to R and RStudio. R Markdown will be used as a tool for documentation and reporting of the analysis results. Next, the class covers data processing steps and introduces epidemiological study designs as well as theoretical and practical aspects of basic and more advanced biostatistical methods. In addition to classical biostatistical approaches such as linear and linear mixed models, newer methods how to deal with missing values, how to perform meta analyses, and for causal inference will be discussed and applied.
Content:
- Introduction to R, RStudio
- Documentation and report writing using R Markdown
- Data setup: create, import, export datasets in R
- Format datasets in R: transform variables and manipulate datasets
- Descriptive statistics
- Tables and graphics to visualize data and results
- Epidemiological study designs and study planning
- Introduction to statistical parameter estimation and hypothesis testing
- Statistical methods for dealing with missing values
- Linear regression and regression models for binary and count data
- Linear mixed models for the analysis of clustered and longitudinal data
- Meta analysis
- Survival analysis
- Optional: Statistical methods for causal inference
Learning goals:
At the end of the course, the students will be able to
- understand the main concepts of basic and more advanced biostatistical methods and select appropriate methods for data analysis of epidemiological studies
- import and manipulate datasets in R for statistical analysis
- perform the data analysis in R considering measurement error and missing values
- document the analysis and report the results using R Markdown
Voraussetzungen
Already at first appointment bring your own laptop with R and RStudio installation.
Needed installations:
Help for the installation can be found e.g. at http://r-tutorial.nl/, or in the first two sessions of Prof. Arnrich's Bridgemodule course (Fundamentals of Programming, October 14 + 15)).
Lern- und Lehrformen
Lectures, practical exercises, group exercises
Leistungserfassung
Condition for admission to final exam:
(i) Hand in solutions to 10 of the 12 weekly assignments
(ii) Pass midterm exam
Final grade:
(i) Multiple choice open book midterm exam (20%)
(ii) Open book take home final exam (80%)
Termine
Course starts October 21!
Course is on Mondays 9:15-12:30h in HE 51 Find here the timeline of the course
NOTICE: on Monday 12/09/19 we are in H.2.57!
Zurück