HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Generalization Properties of NAS under Activation and Skip Connection Search

Zhenyu Zhu Fanghui Liu Grigorios G Chrysos Volkan Cevher

Generalization Properties of NAS under Activation and Skip Connection Search

Abstract

Neural Architecture Search (NAS) has fostered the automatic discovery of state-of-the-art neural architectures. Despite the progress achieved with NAS, so far there is little attention to theoretical guarantees on NAS. In this work, we study the generalization properties of NAS under a unifying framework enabling (deep) layer skip connection search and activation function search. To this end, we derive the lower (and upper) bounds of the minimum eigenvalue of the Neural Tangent Kernel (NTK) under the (in)finite-width regime using a certain search space including mixed activation functions, fully connected, and residual neural networks. We use the minimum eigenvalue to establish generalization error bounds of NAS in the stochastic gradient descent training. Importantly, we theoretically and experimentally show how the derived results can guide NAS to select the top-performing architectures, even in the case without training, leading to a train-free algorithm based on our theory. Accordingly, our numerical validation shed light on the design of computationally efficient methods for NAS. Our analysis is non-trivial due to the coupling of various architectures and activation functions under the unifying framework and has its own interest in providing the lower bound of the minimum eigenvalue of NTK in deep learning theory.

Benchmarks

BenchmarkMethodologyMetrics
neural-architecture-search-on-nats-benchEigenNas (Zhu et al., 2022)
Test Accuracy: 45.54
neural-architecture-search-on-nats-bench-1EigenNas (Zhu et al., 2022)
Test Accuracy: 93.46
neural-architecture-search-on-nats-bench-2EigenNas (Zhu et al., 2022)
Test Accuracy: 71.42

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Generalization Properties of NAS under Activation and Skip Connection Search | Papers | HyperAI