HyperAIHyperAI

Command Palette

Search for a command to run...

a month ago

HyperNetworks

Ha David Dai Andrew Le Quoc V.

HyperNetworks

Abstract

This work explores hypernetworks: an approach of using a one network, alsoknown as a hypernetwork, to generate the weights for another network.Hypernetworks provide an abstraction that is similar to what is found innature: the relationship between a genotype - the hypernetwork - and aphenotype - the main network. Though they are also reminiscent of HyperNEAT inevolution, our hypernetworks are trained end-to-end with backpropagation andthus are usually faster. The focus of this work is to make hypernetworks usefulfor deep convolutional networks and long recurrent networks, wherehypernetworks can be viewed as relaxed form of weight-sharing across layers.Our main result is that hypernetworks can generate non-shared weights for LSTMand achieve near state-of-the-art results on a variety of sequence modellingtasks including character-level language modelling, handwriting generation andneural machine translation, challenging the weight-sharing paradigm forrecurrent networks. Our results also show that hypernetworks applied toconvolutional networks still achieve respectable results for image recognitiontasks compared to state-of-the-art baseline models while requiring fewerlearnable parameters.

Code Repositories

tjuhaoxiaotian/pymarl3
pytorch
Mentioned in GitHub
shyamsn97/hyper-nn
jax
Mentioned in GitHub
g1910/HyperNetworks
pytorch
Mentioned in GitHub
chrhenning/hypnettorch
pytorch
Mentioned in GitHub
gtegner/hyper-gan
pytorch
Mentioned in GitHub
cellistigs/ensemble_attention
pytorch
Mentioned in GitHub
pennfranc/hypnettorch
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
language-modelling-on-enwiki8Hypernetworks
Bit per Character (BPC): 1.34
Number of params: 27M
language-modelling-on-penn-treebank-character2-layer Norm HyperLSTM
Bit per Character (BPC): 1.219
Number of params: 14.4M

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
HyperNetworks | Papers | HyperAI