Bellman Equations and Dynamic Programming

Introduction to Reinforcement Learning

Part 6: Core Theory II:

Bellman Equations and Dynamic Programming

Bellman Equations

Recursive relationships among values that can be used to compute values

The tree of transition dynamics

a path, or trajectory

state

action

possible path

The web of transition dynamics

a path, or trajectory

state action possible path

The web of transition dynamics

state action possible path

backup diagram

................
................

In order to avoid copyright disputes, this page is only a partial summary.

Google Online Preview   Download