Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in RL

Reinforcement Learning Day 2019:
Reward Machines: Structuring Reward Function Specifications and Reducing Sample Complexity in Reinforcement Learning

日期:
演讲者:
Sheila Mcllraith
所属机构:
University of Toronto