Panel: Generalization in reinforcement learning
The ability for a reinforcement learning (RL) policy to generalize is a key requirement for the broad application of RL algorithms. This generalization ability is also essential to the future of RL—both in theory and in practice. Join Microsoft researchers Harm van Seijen, Cheng Zhang, and Mingfei Sun, along with Dr. Wendelin Boehmer from Delft University of Technology and Dr. Roberta Raileanu from New York University, as they examine how agents struggle to transfer learned policies to new environments or tasks and explore why generalization remains challenging for state-of-the-art deep RL algorithms. In addition, they will discuss open questions about the right way to think about generalization in RL, the right way to formalize the problem, and the most important tasks to be considered for generalization. Together, you will explore the importance of studying generalization in RL, the recent research progress in generalization in RL, the open challenges, and the potential research directions in this area.
Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)
- 轨迹:
- Reinforcement Learning
- 日期:
- 演讲者:
- Mingfei Sun, Roberta Raileanu, Wendelin Böhmer, Harm van Seijen, Cheng Zhang
- 所属机构:
- Microsoft Research Cambridge, NYU, Delft University of Technology, Microsoft Research Montreal, Microsoft Research Cambridge
-
-
Mingfei Sun
Researcher
-
Roberta Raileanu
Research Intern
-
Harm van Seijen
Principal Research Manager
-
Cheng Zhang
Principal Researcher
-
Wendelin Böhmer
Assistant Professor
Delft University of Technology
-
-
Reinforcement Learning
-
-
-
Research talk: Reinforcement learning with preference feedback
Speakers:- Aadirupa Saha
-
-
-
-
-
-
Panel: Generalization in reinforcement learning
Speakers:- Mingfei Sun,
- Roberta Raileanu,
- Harm van Seijen
-
-
Research talk: Successor feature sets: Generalizing successor representations across policies
Speakers:- Kiante Brantley
-
Research talk: Towards efficient generalization in continual RL using episodic memory
Speakers:- Mandana Samiei
-
Research talk: Breaking the deadly triad with a target network
Speakers:- Shangtong Zhang
-
Panel: The future of reinforcement learning
Speakers:- Geoff Gordon,
- Emma Brunskill,
- Craig Boutilier
-