HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

SemiReward: A General Reward Model for Semi-supervised Learning

Li Siyuan ; Jin Weiyang ; Wang Zedong ; Wu Fang ; Liu Zicheng ; Tan Cheng ; Li Stan Z.

SemiReward: A General Reward Model for Semi-supervised Learning

Abstract

Semi-supervised learning (SSL) has witnessed great progress with variousimprovements in the self-training framework with pseudo labeling. The mainchallenge is how to distinguish high-quality pseudo labels against theconfirmation bias. However, existing pseudo-label selection strategies arelimited to pre-defined schemes or complex hand-crafted policies speciallydesigned for classification, failing to achieve high-quality labels, fastconvergence, and task versatility simultaneously. To these ends, we propose aSemi-supervised Reward framework (SemiReward) that predicts reward scores toevaluate and filter out high-quality pseudo labels, which is pluggable tomainstream SSL methods in wide task types and scenarios. To mitigateconfirmation bias, SemiReward is trained online in two stages with a generatormodel and subsampling strategy. With classification and regression tasks on 13standard SSL benchmarks across three modalities, extensive experiments verifythat SemiReward achieves significant performance gains and faster convergencespeeds upon Pseudo Label, FlexMatch, and Free/SoftMatch. Code and models areavailable at https://github.com/Westlake-AI/SemiReward.

Code Repositories

Westlake-AI/SemiReward
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
semi-supervised-image-classification-on-1SemiReward
Top 1 Accuracy: 59.64%
semi-supervised-image-classification-on-cifar-8SemiReward
Percentage error: 15.62

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
SemiReward: A General Reward Model for Semi-supervised Learning | Papers | HyperAI