HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Efficient Neural Architecture Search via Parameter Sharing

Hieu Pham; Melody Y. Guan; Barret Zoph; Quoc V. Le; Jeff Dean

Efficient Neural Architecture Search via Parameter Sharing

Abstract

We propose Efficient Neural Architecture Search (ENAS), a fast and inexpensive approach for automatic model design. In ENAS, a controller learns to discover neural network architectures by searching for an optimal subgraph within a large computational graph. The controller is trained with policy gradient to select a subgraph that maximizes the expected reward on the validation set. Meanwhile the model corresponding to the selected subgraph is trained to minimize a canonical cross entropy loss. Thanks to parameter sharing between child models, ENAS is fast: it delivers strong empirical performances using much fewer GPU-hours than all existing automatic model design approaches, and notably, 1000x less expensive than standard Neural Architecture Search. On the Penn Treebank dataset, ENAS discovers a novel architecture that achieves a test perplexity of 55.8, establishing a new state-of-the-art among all methods without post-training processing. On the CIFAR-10 dataset, ENAS designs novel architectures that achieve a test error of 2.89%, which is on par with NASNet (Zoph et al., 2018), whose test error is 2.65%.

Code Repositories

Ezereal/enas
tf
Mentioned in GitHub
MengTianjian/enas-pytorch
pytorch
Mentioned in GitHub
cshannonn/blackscholes_nas
Mentioned in GitHub
aymanshams07/enas_cifar10
tf
Mentioned in GitHub
zbyte64/pytorch-dagsearch
pytorch
Mentioned in GitHub
guoyongcs/NATv2
pytorch
Mentioned in GitHub
countif/enas_nni
tf
Mentioned in GitHub
MINGUKKANG/PNU_Termproject_ENAS
tf
Mentioned in GitHub
ahundt/renas
tf
Mentioned in GitHub
ahundt/enas
tf
Mentioned in GitHub
guoyongcs/NAT
pytorch
Mentioned in GitHub
invisibleForce/ENAS-Pytorch
pytorch
Mentioned in GitHub
rutgerswiselab/autolossgen
pytorch
Mentioned in GitHub
carpedm20/ENAS-pytorch
pytorch
Mentioned in GitHub
HaiTMai/Time-Series-Forecast
Mentioned in GitHub
melodyguan/enas
tf
Mentioned in GitHub
MINGUKKANG/PNU_Capstone_Design
tf
Mentioned in GitHub
RualPerez/AutoML
pytorch
Mentioned in GitHub
WillButAgain/ENAS
pytorch
Mentioned in GitHub
MINGUKKANG/ENAS-Tensorflow
tf
Mentioned in GitHub
distrue/enas_tensorflow
tf
Mentioned in GitHub
f51980280/ENAS-Implement
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
architecture-search-on-cifar-10-imageENAS + c/o
Params: 4.6M
Percentage error: 2.89
language-modelling-on-penn-treebank-wordEfficient NAS
Params: 24M
Test perplexity: 58.6
Validation perplexity: 60.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Efficient Neural Architecture Search via Parameter Sharing | Papers | HyperAI