Microsoft Research blog
Loading...
Microsoft Research Blog
Finding the best learning targets automatically: Fully Parameterized Quantile Function for distributional RL
| Li Zhao
Reinforcement learning has achieved great success in game scenarios, with RL agents beating human competitors in such games as Go and poker. Distributional reinforcement learning, in particular, has proven to be an effective approach for training an agent to maximize…