Bellman Equations

For Markov Decision Processes (MDPs), the Bellman equations are as follows.

Thoughts

  • Solving MDPs via the Bellman equations assume that we know the full environment dynamics, we have computational resources and the Markov property, p66 (Sutton and Barto 2018).

Author: Nazaal

Created: 2022-03-13 Sun 21:44

Validate