Pinwheel graphic representing the Microsoft Research Summit
Return to Event: Microsoft Research Summit 2021

Microsoft Research Summit 2021 • Videos

Research talk: Project Dexter: Machine learning and automatic decision-making for robotic manipulation

Robot technology has long held the promise of disrupting many important industries that involve dexterous object manipulation in weakly structured environments, including healthcare, agriculture, and infrastructure maintenance. The increasing versatility of robotic manipulation hardware seemingly puts this disruption within reach. However, hardware versatility comes at a cost. Its complexity defies traditional control-based approaches, and recent success stories, such as a dexterous robotic hand assembling a Rubik’s cube, highlight the reality: it can take world-class roboticists dozens of person-years to train an advanced robotic manipulator to solve a single problem with modern machine learning–based sequential decision-making techniques. The goal of Microsoft Research’s Project Dexter is to enable training robotic manipulation policies for real-world tasks at a practical cost in terms of expertise, time, and compute. This talk will give an overview of Project Dexter’s research agenda, which focuses on learning generalizable representations of visual, tactile, and proprioceptive observation data on Transformer-based architectures, and on both reinforcement learning and imitation learning approaches capable of using this generalized knowledge with other prior information to achieve practically acceptable sample efficiency. We will outline several directions we are currently exploring in this large problem space and briefly delve into one of them—an offline reinforcement learning approach the team is researching in the robotic manipulation context.

Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit (opens in new tab)

Track:
Reinforcement Learning
Date:
Speakers:
Andrey Kolobov, Ching-An Cheng
Affiliation:
Microsoft Research Redmond

Reinforcement Learning