Command Palette
Search for a command to run...
Eunbyung Park; Junier B. Oliva

Abstract
We propose meta-curvature (MC), a framework to learn curvature information for better generalization and fast model adaptation. MC expands on the model-agnostic meta-learner (MAML) by learning to transform the gradients in the inner optimization such that the transformed gradients achieve better generalization performance to a new task. For training large scale neural networks, we decompose the curvature matrix into smaller matrices in a novel scheme where we capture the dependencies of the model's parameters with a series of tensor products. We demonstrate the effects of our proposed method on several few-shot learning tasks and datasets. Without any task specific techniques and architectures, the proposed method achieves substantial improvement upon previous MAML variants and outperforms the recent state-of-the-art methods. Furthermore, we observe faster convergence rates of the meta-training process. Finally, we present an analysis that explains better generalization performance with the meta-trained curvature.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| few-shot-image-classification-on-mini-2 | MC2+ | Accuracy: 55.73 |
| few-shot-image-classification-on-mini-3 | MC2+ | Accuracy: 70.33 |
| few-shot-image-classification-on-omniglot-1-1 | MC2+ | Accuracy: 88% |
| few-shot-image-classification-on-omniglot-1-2 | MC2+ | Accuracy: 99.97 |
| few-shot-image-classification-on-omniglot-5-1 | MC2+ | Accuracy: 99.65% |
| few-shot-image-classification-on-omniglot-5-2 | MC2+ | Accuracy: 99.89 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.