HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Trainable Activations for Image Classification

{Evgenii Pishchik}

Abstract

Non-linear activation functions are one of the main parts of deep neural network architectures. The choice of the activation function can affect model speed, performance and convergence. Most popular activation functions don't have any trainable parameters and don't alter during the training. We propose different activation functions with and without trainable parameters. Said activation functions have a number of advantages and disadvantages. We'll be testing the performance of said activation functions and comparing the results with widely known activation function ReLU. We assume that the activation functions with trainable parameters can outperform functions without ones, because the trainable parameters allow the model to "select'' the type of each of the activation functions itself, however, this strongly depends on the architecture of the deep neural network and the activation function itself.

Benchmarks

BenchmarkMethodologyMetrics
image-classification-on-cifar-10ResNet-44 (Trainable Activations)
Percentage correct: 90.5
image-classification-on-cifar-10ResNet-56 (Trainable Activations)
Percentage correct: 88.8
image-classification-on-cifar-10ResNet-8 (Trainable Activations)
Percentage correct: 86.5
image-classification-on-cifar-10ResNet-32 (Trainable Activations)
Percentage correct: 90.9
image-classification-on-cifar-10ResNet-14 (Trainable Activations)
Percentage correct: 89.0
image-classification-on-cifar-10ResNet-26 (Trainable Activations)
Percentage correct: 91.1
image-classification-on-cifar-10ResNet-20 (Trainable Activations)
Percentage correct: 90.4
image-classification-on-mnistDNN-3 (Trainable Activations)
Accuracy: 97.0
Percentage error: 3.0
Trainable Parameters: 386719
image-classification-on-mnistDNN-2 (Trainable Activations)
Accuracy: 96.4
Percentage error: 3.6
Trainable Parameters: 311651
image-classification-on-mnistDNN-5 (Trainable Activations)
Accuracy: 97.2
Percentage error: 2.8
Trainable Parameters: 575051

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Trainable Activations for Image Classification | Papers | HyperAI