HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Aravind Srinivas Michael Laskin Pieter Abbeel

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Abstract

We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 1.9x and 1.2x performance gains at the 100K environment and interaction steps benchmarks respectively. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency of methods that use state-based features. Our code is open-sourced and available at https://github.com/MishaLaskin/curl.

Code Repositories

MishaLaskin/curl
Official
pytorch
KarlXing/RL-Visual-Continuous-Control
pytorch
Mentioned in GitHub
gijskoning/ReproducingCURL
pytorch
Mentioned in GitHub
aravindsrinivas/curl_rainbow
pytorch
Mentioned in GitHub
asparius/barlowrl
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
atari-games-on-atari-2600-alienCURL
Score: 1148.2
atari-games-on-atari-2600-amidarCURL
Score: 232.3
atari-games-on-atari-2600-assaultCURL
Score: 543.7
atari-games-on-atari-2600-asterixCURL
Score: 524.3
atari-games-on-atari-2600-bank-heistCURL
Score: 193.7
atari-games-on-atari-2600-battle-zoneCURL
Score: 11208
atari-games-on-atari-2600-boxingCURL
Score: 4.8
atari-games-on-atari-2600-breakoutCURL
Score: 18.2
atari-games-on-atari-2600-chopper-commandCURL
Score: 1198
atari-games-on-atari-2600-crazy-climberCURL
Score: 27805.6
atari-games-on-atari-2600-demon-attackCURL
Score: 834
atari-games-on-atari-2600-freewayCURL
Score: 27.9
atari-games-on-atari-2600-frostbiteCURL
Score: 924
atari-games-on-atari-2600-gopherCURL
Score: 801.4
atari-games-on-atari-2600-heroCURL
Score: 6235.1
atari-games-on-atari-2600-james-bondCURL
Medium Human-Normalized Score: 400.1
atari-games-on-atari-2600-kangarooCURL
Score: 345.3
atari-games-on-atari-2600-krullCURL
Score: 3833.6
atari-games-on-atari-2600-kung-fu-masterCURL
Score: 14280
atari-games-on-atari-2600-ms-pacmanCURL
Score: 1492.8
atari-games-on-atari-2600-pongCURL
Score: 2.1
atari-games-on-atari-2600-private-eyeCURL
Score: 105.2
atari-games-on-atari-2600-qbertCURL
Score: 1225.6
atari-games-on-atari-2600-road-runnerCURL
Score: 6786.7
atari-games-on-atari-2600-seaquestCURL
Score: 408
atari-games-on-atari-2600-up-and-downCURL
Score: 2735.2
continuous-control-on-ball-in-cup-catchCURL
Score: 959
continuous-control-on-ball-in-cup-catch-1CURL
Score: 769
continuous-control-on-cartpole-swingupCURL
Score: 841
continuous-control-on-cartpole-swingup-1CURL
Score: 582
continuous-control-on-cheetah-runCURL
Score: 518
continuous-control-on-cheetah-run-1CURL
Score: 299
continuous-control-on-finger-spinCURL
Score: 926
continuous-control-on-finger-spin-1CURL
Score: 767
continuous-control-on-reacher-easyCURL
Score: 929
continuous-control-on-reacher-easy-1CURL
Score: 538
continuous-control-on-walker-walkCURL
Score: 902
continuous-control-on-walker-walk-1CURL
Score: 403

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
CURL: Contrastive Unsupervised Representations for Reinforcement Learning | Papers | HyperAI