arXiv:2004.04136 Abstract | arXiv Analytics

arXiv:2004.04136 [cs.LG]Abstract References Reviews Resources

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Aravind Srinivas, Michael Laskin, Pieter Abbeel

Published 2020-04-08Version 1

We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 2.8x and 1.6x performance gains respectively at the 100K interaction steps benchmark. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency and performance of methods that use state-based features.

Comments: First two authors contributed equally, website: https://mishalaskin.github.io/curl code: https://github.com/MishaLaskin/curl

Categories: cs.LG, cs.CV, stat.ML

Keywords: contrastive unsupervised representations, reinforcement learning, deepmind control suite, curl extracts high-level features, 100k interaction steps benchmark

Tags: github project

Related articles: Most relevant | Search more

arXiv:1706.04711 [cs.LG] (Published 2017-06-15)

Reinforcement Learning under Model Mismatch

Aurko Roy, Huan Xu, Sebastian Pokutta

arXiv:1306.6189 [cs.LG] (Published 2013-06-26)

Scaling Up Robust MDPs by Reinforcement Learning

Aviv Tamar, Huan Xu, Shie Mannor

arXiv:1301.0601 [cs.LG] (Published 2012-12-12)