HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Temporal Ensembling for Semi-Supervised Learning

Samuli Laine; Timo Aila

Temporal Ensembling for Semi-Supervised Learning

Abstract

In this paper, we present a simple and efficient method for training deep neural networks in a semi-supervised setting where only a small portion of training data is labeled. We introduce self-ensembling, where we form a consensus prediction of the unknown labels using the outputs of the network-in-training on different epochs, and most importantly, under different regularization and input augmentation conditions. This ensemble prediction can be expected to be a better predictor for the unknown labels than the output of the network at the most recent training epoch, and can thus be used as a target for training. Using our method, we set new records for two standard semi-supervised learning benchmarks, reducing the (non-augmented) classification error rate from 18.44% to 7.05% in SVHN with 500 labels and from 18.63% to 16.55% in CIFAR-10 with 4000 labels, and further to 5.12% and 12.16% by enabling the standard augmentations. We additionally obtain a clear improvement in CIFAR-100 classification accuracy by using random images from the Tiny Images dataset as unlabeled extra inputs during training. Finally, we demonstrate good tolerance to incorrect labels.

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-image-classification-on-cifarPi Model
Percentage error: 12.16
semi-supervised-image-classification-on-cifar-2Temporal ensembling
Percentage error: 38.65
semi-supervised-image-classification-on-cifar-6Ⅱ-Model
Percentage error: 53.12

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Temporal Ensembling for Semi-Supervised Learning | Papers | HyperAI