Eligibility traces
Thoughts
- 2021 DeepMind x UCL RL Lecture Series - Model-free Prediction
[5/13]
, 1:27:09 - Rewrite MC error as a sum of TD errors, rearrange double sum in gradient of weights in linear function approximation case.
[5/13]
, 1:27:09 - Rewrite MC error as a sum of TD errors, rearrange double sum in gradient of weights in linear function approximation case.Created: 2022-03-13 Sun 21:44