Research talk: Making deep reinforcement learning industrially applicable
Deep reinforcement learning has achieved remarkable success, especially in gaming and other applications whose environments are artificial or are associated with low exploration costs. However, for most critical industrial applications, interactions with the environments are very costly—and bad explorations might lead to a disaster. In this situation, a new paradigm of deep reinforcement learning is greatly needed. In this talk, the researchers will introduce a new framework called continual offline reinforcement learning and discuss how to better trade off between policy improvement and global convergence in this framework. They will also discuss how to evaluate an offline learned policy in a more accurate manner before deploying it into real environments. After that, they will introduce several real examples, in which continual offline reinforcement learning was applied to solve difficult problems in the industrial domains of logistics and supply chain. At the end, the researchers will discuss remaining challenges and technical trends in this important space.
Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)
- Évènement :
- Microsoft Research Summit 2021
- Piste :
- Reinforcement Learning
- Date:
- Haut-parleurs:
- Jiang Bian, Tie-Yan Liu
- Affiliation:
- Microsoft Research Asia
-
-
Jiang Bian
Senior Principal Research Manager
-
Tie-Yan Liu
Distinguished Scientist, Microsoft Research AI for Science
-
-
Reinforcement Learning
-
-
-
Research talk: Reinforcement learning with preference feedback
Speakers:- Aadirupa Saha
-
-
-
-
-
-
Panel: Generalization in reinforcement learning
Speakers:- Mingfei Sun,
- Roberta Raileanu,
- Harm van Seijen
-
-
Research talk: Successor feature sets: Generalizing successor representations across policies
Speakers:- Kiante Brantley
-
Research talk: Towards efficient generalization in continual RL using episodic memory
Speakers:- Mandana Samiei
-
Research talk: Breaking the deadly triad with a target network
Speakers:- Shangtong Zhang
-
Panel: The future of reinforcement learning
Speakers:- Geoff Gordon,
- Emma Brunskill,
- Craig Boutilier
-