Tuesday, July 21, 2020

Notes on CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Very interesting state of the art research in reinforcement learning (RL) by University of California Berkeley with Pieter Abbeel.

However, when you look at table 2 on page 8 (or see below) of this paper you will be astonished to find out that of the 26 Atari games (How old is Atari? 1984-2013) that were benchmarked against some of the best state of the art RL models, human performance still wins 22 games. At least, super human performance is achieved in every of those 4 games where RL wins. However, human performance still beats the best RL by a wide margin in 18 of the 26 games. This is almost a bit shocking! 

[2004.04136] CURL: Contrastive Unsupervised Representations for Reinforcement Learning

No comments: