Responsible aritifical Intelligence (Sommersemester 2023)
Dozent:
Prof. Dr. Holger Giese
(Systemanalyse und Modellierung)
,
Christian Medeiros Adriano
(Systemanalyse und Modellierung)
,
Christian Schäffer
(Systemanalyse und Modellierung)
,
Matthias Barkowsky
(Systemanalyse und Modellierung)
Allgemeine Information
- Semesterwochenstunden: 4
- ECTS: 6
- Benotet:
Ja
- Einschreibefrist: 01.04.2023 - 07.05.2023
- Lehrform: Projektseminar
- Belegungsart: Wahlpflichtmodul
- Lehrsprache: Englisch
Studiengänge, Modulgruppen & Module
- OSIS: Operating Systems & Information Systems Technology
- HPI-OSIS-K Konzepte und Methoden
- OSIS: Operating Systems & Information Systems Technology
- HPI-OSIS-S Spezialisierung
- OSIS: Operating Systems & Information Systems Technology
- HPI-OSIS-T Techniken und Werkzeuge
- SAMT: Software Architecture & Modeling Technology
- HPI-SAMT-K Konzepte und Methoden
- SAMT: Software Architecture & Modeling Technology
- HPI-SAMT-S Spezialisierung
- SAMT: Software Architecture & Modeling Technology
- HPI-SAMT-T Techniken und Werkzeuge
- DANA: Data Analytics
- HPI-DANA-K Konzepte und Methoden
- DANA: Data Analytics
- HPI-DANA-T Techniken und Werkzeuge
- DANA: Data Analytics
- HPI-DANA-S Spezialisierung
- APAD: Acquisition, Processing and Analysis of Health Data
- HPI-APAD-C Concepts and Methods
- APAD: Acquisition, Processing and Analysis of Health Data
- HPI-APAD-T Technologies and Tools
- APAD: Acquisition, Processing and Analysis of Health Data
- HPI-APAD-S Specialization
- OISY: Online and Interactive Systems
- HPI-OISY-C Concepts and Methods
- OISY: Online and Interactive Systems
- HPI-OISY-T Technologies and Tools
- OISY: Online and Interactive Systems
- HPI-OISY-S Specialization
- DSYS: Data-Driven Systems
- HPI-DSYS-C Concepts and Methods
- DSYS: Data-Driven Systems
- HPI-DSYS-T Technologies and Tools
- DSYS: Data-Driven Systems
- HPI-DSYS-S Specialization
- MALA: Machine Learning and Analytics
- HPI-MALA-C Concepts and Methods
- MALA: Machine Learning and Analytics
- HPI-MALA-T Technologies and Tools
- MALA: Machine Learning and Analytics
- HPI-MALA-S Specialization
Beschreibung
Motivation
“Machine learning is an ostensibly technical field, crashing increasingly on human questions. Our human, social, and civic dilemmas are becoming technical. And our technical dilemmas are becoming human, social, and civic. Our successes and failures alike in getting these systems to do ‘what we want’, it turns out, offers an unflinching, revelatory mirror.” - Brian Christian [1]
While machine learning-based systems have presented impressive feats, the industry and governments still face difficult dilemmas to deploy these systems. From the engineering perspective, two imperatives help guide design decisions: the ethical and the explanatory one. Although machines are not humans, machines are expected to obey the ethical norms that regulate human society. Ethics is one of the necessary conditions for responsible behavior, but it is not sufficient, autonomous agents also need to explain their actions or the lack thereof.
This project seminar will contemplate lectures on a set of topics focused on drawing practical lessons and guidance to the future of the engineering of intelligent safety-critical systems.
Topics
Foundations
- Schools of Ethics
- Knowledge and Truth (Gettier cases)
- Ethical dilemmas
Theory
- AI Alignment, Human-Centered AI
- Fairness, Safety, Robustness, Explainability, Accountability
- Safety-Critical Systems
Solutions
- Large Language Models (ChatGPT, GPT4)
- Reinforcement Learning (Safety, Reward modeling, Human-in-the-loop)
- Causal Machine Learning and Counterfactual Reasoning
- Neurosymbolic methods
- Evaluation methods (sensitivity analysis, generalization, transportability, ablation studies, threats to validity)
Literatur
- Christian, B., 2021, The alignment problem: How can machines learn human values?. Atlantic Books.
- Bubeck, S., et al., 2023, Sparks of artificial general intelligence: Early experiments with GPT-4. arXiv preprint arXiv:2303.12712
- Mialon, G., et al., 2023, Augmented language models: a survey. arXiv:2302.07842
- Jojic, A., Wang, Z., & Jojic, N., 2023, GPT is becoming a Turing machine: Here are some ways to program it. arXiv preprint arXiv:2303.14310.
- Mökander, J., et al., 2023, Auditing large language models: a three-layered approach. arXiv:2302.08500
- Mitchell, M., 2021, Why AI is harder than we think. arXiv preprint arXiv:2104.12871
- Brundage, M., et al., 2020, Toward trustworthy AI development: mechanisms for supporting verifiable claims
- Morley, J., et al., 2021, Ethics as a service: a pragmatic operationalisation of AI Ethics. Minds and Machines.
- Xiong, P., et al., 2021, Towards a Robust and Trustworthy Machine Learning System Development.
- Cammarota, R., et al., 2020, Trustworthy AI Inference Systems: An Industry Research View.
- Coeckelbergh, M., 2020, AI Ethics. MIT Press.
- Rudin, C., et al., 2021, Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges
- Karimi, et al., 2021, A survey of algorithmic recourse: contrastive explanations and consequential recommendations.
- Bommasani, R., et al., 2021, On the Opportunities and Risks of Foundation Models.
- Wing, J. 2021, Trust AI, CACM
- IEEE, 2019, The IEEE Global Initiative on Ethics of Autonomous and Intelligent Systems. Ethically Aligned Design: A Vision for Prioritizing Human Well-being with Autonomous and Intelligent Systems, First Edition.
Lern- und Lehrformen
The course is a project seminar, which has an introductory phase comprising initial short lectures. After that, the students will work in groups on jointly identified experiments applying specific solutions to given problems and finally prepare a presentation and write a report about their findings concerning the experiments.
There will be an introductory phase to present basic concepts for the theme, including the necessary foundations.
Lectures will happen through Zoom from our seminar room. The students interested can also join face-to-face in the seminar room.
Leistungserfassung
We will grade the group's reports (80%) and presentations (20%). Note that the report includes documenting the experiments and the obtained results. Therefore, the grading of the report includes the experiments. During the project phase, we will require participation in meetings and other groups' presentations in the form of questions and feedback to their peers.
Termine
The first lecture will take place on April 24, 2023 (Monday). The lectures will take place in room A-1.1 and remotely via Zoom (credentials)*
We will follow the recurrent schedule of:
- Mondays from 9:15-10:45 in room A-1.1
- Tuesdays from 17:00-18:30 in room A-1.1
* In case that you do not have access to GitLab, please email christian.adriano [at] hpi.de
Zurück