Policy Gradient Methods: Tutorial and New Frontiers

In this tutorial we discuss several recent advances in deep reinforcement learning involving policy gradient methods. These methods have shown significant success in a wide range of domains, including continuous-action domains such as manipulation, locomotion, and flight. They have also achieved the state of the art in discrete action domains such as Atari. We will provide a unifying overview of a variety of different policy gradient methods, and we will also discuss the formalism of stochastic computation graphs for computing gradients of expectations.

专题：: Cambridge Lab PhD Summer School
日期：: 2017年7月3日
演讲者：: John Schulman
所属机构：: UC Berkeley

- Scarlet Schwiderski-Grosche
  
  Director
研究领域
研究院
- Microsoft Research Lab - Cambridge
活动
- AI Summer School 2017

系列： Cambridge Lab PhD Summer School

The Malmo Collaborative AI Challenge
July 6, 2017
Speakers:

Scarlet Schwiderski-Grosche
Counterfactual Multi-Agent Policy Gradients
July 6, 2017
Speakers:

Scarlet Schwiderski-Grosche
Design - On the Human Side
July 5, 2017
Speakers:

Alex Taylor,

Scarlet Schwiderski-Grosche
Probabilistic Machine Learning and AI
July 5, 2017
Speakers:

Scarlet Schwiderski-Grosche
Policy Gradient Methods: Tutorial and New Frontiers
July 3, 2017
Speakers:

Scarlet Schwiderski-Grosche
Strategic Thinking for Researchers
August 1, 2016
Speakers:

Andy Gordon,

Jeff Running
How to Write a Great Research Paper
July 8, 2016
Speakers:

Scarlet Schwiderski-Grosche,

Simon Peyton Jones
Project Malmo – a platform for fundamental AI research
July 7, 2016
Speakers:

Scarlet Schwiderski-Grosche
No Compromises: Distributed Transactions with Consistency, Availability, and Performance
July 5, 2016
Speakers:

Scarlet Schwiderski-Grosche
The Evolution of Innovation
July 5, 2016
Speakers:

Scarlet Schwiderski-Grosche
How to Give a Great Research Talk
July 5, 2016
Speakers:

Scarlet Schwiderski-Grosche,

Simon Peyton Jones