101.789 AKNUM Reinforcement Learning
This course is in all assigned curricula part of the STEOP.
This course is in at least 1 assigned curriculum part of the STEOP.

2022S, VU, 4.0h, 6.0EC

TUWEL course

Properties

Semester hours: 4.0
Credits: 6.0
Type: VU Lecture and Exercise
Format: Hybrid

Learning outcomes

After successful completion of the course, students are able to not only understand, to explain and to apply the theory and the methods of reinforcement learning including the latest developments, but also to implement the most important algorithms.

Subject of course

This year (2022) with updated lecture notes!

Reinforcement learning is a field of artificial intelligence and is concerned with the development of strategies that an agent uses to maximize its reward in a random environment in a model-free manner.

Applications include robotics (OpenAI gym), computer vision, games (such as Go, chess, Atari 2600, or Dota 2) at the human level or above and many more.

Theory and algorithms of reinforcement learning:

Introduction
Bandit problems
Markov decision problems
Bellman equations
Hamilton-Jacobi-Bellman equation
Dynamic programming
Monte-Carlo learning
Temporal-difference learning
Tabular methods
Function approximation and deep learning
On-policy vs. off-policy
Eligibility traces
Policy gradients and actor-critic
Applications

In the tutorial, the theory will be repeated and extended and the algorithms will be implemented.

Teaching methods

Presentation, lecture notes, tutorial.

Mode of examination

Written

Additional information

Time for first meeting will be announced.

The class will be taught in presence, streamed, and recorded.

Lecturers

Heitzinger, Clemens

Institute

E101 Institute of Analysis and Scientific Computing

Course dates

Day	Time	Date	Location	Description
Tue	13:00 - 15:00	01.03.2022 - 28.06.2022	Sem.R. DB gelb 03	Reinforcement Learning
Wed	13:00 - 15:00	02.03.2022 - 29.06.2022	Sem.R. DB gelb 03	Reinforcement Learning

Show single appointments

AKNUM Reinforcement Learning - Single appointments

Day	Date	Time	Location	Description
Tue	01.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	02.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	08.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	09.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	15.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	16.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	22.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	23.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	29.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	30.03.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	05.04.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	06.04.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	26.04.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	27.04.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	03.05.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	04.05.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	10.05.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	11.05.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Tue	17.05.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning
Wed	18.05.2022	13:00 - 15:00	Sem.R. DB gelb 03	Reinforcement Learning

Examination modalities

Continuously in tutorials; written tests.

Course registration

Begin	End	Deregistration end
07.03.2022 00:00	13.04.2022 00:00	13.04.2022 00:00

Curricula

Study Code	Obligation	Semester	Precon.	Info
066 645 Data Science	Mandatory elective
860 GW Optional Courses - Technical Mathematics	Not specified

Literature

Lecture notes (in English) will be handed out.

Previous knowledge

The theoretical aspects will be explained in the lectures in a self-contained manner so that the course can be taken during or after the fourth semester.

Miscellaneous

Course homepage

Language

if required in English