HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Backpropagation through Combinatorial Algorithms: Identity with Projection Works

Subham Sekhar Sahoo Anselm Paulus Marin Vlastelica Vít Musil Volodymyr Kuleshov Georg Martius

Backpropagation through Combinatorial Algorithms: Identity with Projection Works

Abstract

Embedding discrete solvers as differentiable layers has given modern deep learning architectures combinatorial expressivity and discrete reasoning capabilities. The derivative of these solvers is zero or undefined, therefore a meaningful replacement is crucial for effective gradient-based learning. Prior works rely on smoothing the solver with input perturbations, relaxing the solver to continuous problems, or interpolating the loss landscape with techniques that typically require additional solver calls, introduce extra hyper-parameters, or compromise performance. We propose a principled approach to exploit the geometry of the discrete solution space to treat the solver as a negative identity on the backward pass and further provide a theoretical justification. Our experiments demonstrate that such a straightforward hyper-parameter-free approach is able to compete with previous more complex methods on numerous experiments such as backpropagation through discrete samplers, deep graph matching, and image retrieval. Furthermore, we substitute the previously proposed problem-specific and label-dependent margin with a generic regularization procedure that prevents cost collapse and increases robustness.

Code Repositories

martius-lab/solver-differentiation-identity
Official
tf
Mentioned in GitHub
khalil-research/pyepo
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
density-estimation-on-mnistIdentity
NLL (bits/dim): 0.134

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Backpropagation through Combinatorial Algorithms: Identity with Projection Works | Papers | HyperAI