Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in RL
Reinforcement Learning Day 2019:
Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in Reinforcement Learning
- 日期:
- 演讲者:
- Sheila Mcllraith
- 所属机构:
- University of Toronto