Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming

Speaker: Steven L. Brunton
-/27:09
Ask the seminar a question! BETA
Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming
Prof. Steven L. Brunton
Steven L. Brunton
University of Washington

Here we introduce dynamic programming, which is a cornerstone of model-based reinforcement learning. We demonstrate dynamic programming for policy iteration and value iteration, leading to the quality function and Q-learning.

References
  • 1.
    https://openlibrary.org/isbn/9781108422093
Brunton Lab logo
Reinforcement learning
Brunton Lab (University of Washington)
Cite as
S. L. Brunton (2022, January 7), Model Based Reinforcement Learning: Policy Iteration, Value Iteration, and Dynamic Programming
Share
Details
Listed seminar This seminar is open to all
Recorded Available to all
Video length 27:09