Unsupervised State Representation Learning in Atari
- Ankesh Anand ,
- Evan Racah ,
- Sherjil Ozair ,
- Yoshua Bengio ,
- Marc-Alexandre Côté ,
- Devon Hjelm
NeurIPS 2019 |
ArXiv
State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporally distinct features of a neural encoder of the observations. We also introduce a new benchmark based on Atari 2600 games where we evaluate representations based on how well they capture the ground truth state variables. We believe this new framework for evaluating representation learning models will be crucial for future representation learning research. Finally, we compare our technique with other state-of-the-art generative and contrastive representation learning methods. The code associated with this work is available on GitHub.