However, when you look at table 2 on page 8 (or see below) of this paper you will be astonished to find out that of the 26 Atari games (How old is Atari? 1984-2013) that were benchmarked against some of the best state of the art RL models, human performance still wins 22 games. At least, super human performance is achieved in every of those 4 games where RL wins. However, human performance still beats the best RL by a wide margin in 18 of the 26 games. This is almost a bit shocking!
[2004.04136] CURL: Contrastive Unsupervised Representations for Reinforcement Learning
No comments:
Post a Comment