MSR theme: Reinforcement Learning Research

Return to Microsoft Research Lab – Montréal

Reinforcement Learning | Montréal

项目

Offline Reinforcement Learning

This page introduces the research area of Offline Reinforcement Learning (also sometimes called Batch Reinforcement Learning). It consists in training a target policy from a fixed dataset of trajectories collected with a behavioral policy. In comparison to classic Reinforcement Learning…

Hybrid Reward Architecture

For reinforcement learning (RL), where the goal is to learn good behavior in a data-driven way, the Arcade Learning Environment (ALE), which provides access to a large number of Atari 2600 games, has been a popular test-bed.