Model-Free RL

about | blog | config | notes | github

A paradigm of Reinforcement Learning that makes use of only a Policy Function and Value Function that does not concern itself with learning the dynamics of the RL Environment with an internal Environment Model.

Created: 2021-11-13

Emacs 26.1 (Org mode 9.5)