HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

0/1 Deep Neural Networks via Block Coordinate Descent

Zhang Hui ; Zhou Shenglong ; Li Geoffrey Ye ; Xiu Naihua

0/1 Deep Neural Networks via Block Coordinate Descent

Abstract

The step function is one of the simplest and most natural activationfunctions for deep neural networks (DNNs). As it counts 1 for positivevariables and 0 for others, its intrinsic characteristics (e.g., discontinuityand no viable information of subgradients) impede its development for severaldecades. Even if there is an impressive body of work on designing DNNs withcontinuous activation functions that can be deemed as surrogates of the stepfunction, it is still in the possession of some advantageous properties, suchas complete robustness to outliers and being capable of attaining the bestlearning-theoretic guarantee of predictive accuracy. Hence, in this paper, weaim to train DNNs with the step function used as an activation function (dubbedas 0/1 DNNs). We first reformulate 0/1 DNNs as an unconstrained optimizationproblem and then solve it by a block coordinate descend (BCD) method. Moreover,we acquire closed-form solutions for sub-problems of BCD as well as itsconvergence properties. Furthermore, we also integrate$\ell_{2,0}$-regularization into 0/1 DNN to accelerate the training process andcompress the network scale. As a result, the proposed algorithm has a highperformance on classifying MNIST and Fashion-MNIST datasets. As a result, theproposed algorithm has a desirable performance on classifying MNIST,FashionMNIST, Cifar10, and Cifar100 datasets.

Benchmarks

BenchmarkMethodologyMetrics
3d-face-alignment-on-facewarehouseface
0..5sec: 1
3d-multi-object-tracking-on-waymo-open-1RobMOT
FP/L2: 0.0703
MOTA/L1: 0.7772
MOTA/L2: 0.7466
abstractive-text-summarization-on1
10-stage average accuracy: 0
audio-classification-on-icbhi-respiratoryM2D-X/0.7 (η=0.3)
ICBHI Score: 63.29
deepfake-detection-onA
0..5sec: 12
deepfake-detection-on-1STYLE
0L: 99
denoising-ontest
10-way 5~10-shot: reza
fake-image-detection-onGshh
0..5sec: H
fake-image-detection-on-1Him
0..5sec: 2
highlight-detection-on-ai-ch-priichyaaKenan Kanan
10-20% Mask PSNR: https://youtu.be/pJ0auP7dbcY?si=vSiZevfJ57YUKC2q
language-modelling-onkalach20
0..5sec: Assen
multimodal-emotion-recognition-on-iemocap-4bc-LSTM
Weighted F1: 74.1
object-detection-on-10000-people-human-poseWhat
0-shot MRR: Are
question-answering-on-newsqaOpenAI/o1-2024-12-17-high
EM: 81.44
F1: 88.7
real-time-object-detection-on-cocoD-FINE-L+
FPS (V100, b=1): 124 (T4)
box AP: 57.1
rgb-t-tracking-on-123Claudiu
0L: 100
robot-task-planning-on-rlbenchSAM2Act
Success Rate: 0.868

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
0/1 Deep Neural Networks via Block Coordinate Descent | Papers | HyperAI