Command Palette
Search for a command to run...
Rafael Pinto

Abstract
Episodic control provides a highly sample-efficient method for reinforcement learning while enforcing high memory and computational requirements. This work proposes a simple heuristic for reducing these requirements, and an application to Model-Free Episodic Control (MFEC) is presented. Experiments on Atari games show that this heuristic successfully reduces MFEC computational demands while producing no significant loss of performance when conservative choices of hyperparameters are used. Consequently, episodic control becomes a more feasible option when dealing with reinforcement learning tasks.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| atari-games-on-atari-2600-frostbite | MFEC | Best Score: 4020 Score: 2394 |
| atari-games-on-atari-2600-hero | MFEC | Best Score: 13190 Score: 11732 |
| atari-games-on-atari-2600-ms-pacman | MFEC | Best Score: 11301 Score: 8530.4004 |
| atari-games-on-atari-2600-qbert | MFEC | Best Score: 19750 Score: 14135 |
| atari-games-on-atari-2600-river-raid | MFEC | Best Score: 5080 Score: 3868 |
| atari-games-on-atari-2600-space-invaders | MFEC | Best Score: 2490 Score: 1990 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.