HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

RAFT-3D: Scene Flow using Rigid-Motion Embeddings

Zachary Teed Jia Deng

RAFT-3D: Scene Flow using Rigid-Motion Embeddings

Abstract

We address the problem of scene flow: given a pair of stereo or RGB-D video frames, estimate pixelwise 3D motion. We introduce RAFT-3D, a new deep architecture for scene flow. RAFT-3D is based on the RAFT model developed for optical flow but iteratively updates a dense field of pixelwise SE3 motion instead of 2D motion. A key innovation of RAFT-3D is rigid-motion embeddings, which represent a soft grouping of pixels into rigid objects. Integral to rigid-motion embeddings is Dense-SE3, a differentiable layer that enforces geometric consistency of the embeddings. Experiments show that RAFT-3D achieves state-of-the-art performance. On FlyingThings3D, under the two-view evaluation, we improved the best published accuracy (d < 0.05) from 34.3% to 83.7%. On KITTI, we achieve an error of 5.77, outperforming the best published method (6.31), despite using no object instance supervision. Code is available at https://github.com/princeton-vl/RAFT-3D.

Code Repositories

princeton-vl/RAFT-3D
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
scene-flow-estimation-on-springRAFT-3D (F)
1px total: 78.822
scene-flow-estimation-on-springRAFT-3D (K)
1px total: 37.262

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
RAFT-3D: Scene Flow using Rigid-Motion Embeddings | Papers | HyperAI