HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Iterative Geometry Encoding Volume for Stereo Matching

Gangwei Xu; Xianqi Wang; Xiaohuan Ding; Xin Yang

Iterative Geometry Encoding Volume for Stereo Matching

Abstract

Recurrent All-Pairs Field Transforms (RAFT) has shown great potentials in matching tasks. However, all-pairs correlations lack non-local geometry knowledge and have difficulties tackling local ambiguities in ill-posed regions. In this paper, we propose Iterative Geometry Encoding Volume (IGEV-Stereo), a new deep network architecture for stereo matching. The proposed IGEV-Stereo builds a combined geometry encoding volume that encodes geometry and context information as well as local matching details, and iteratively indexes it to update the disparity map. To speed up the convergence, we exploit GEV to regress an accurate starting point for ConvGRUs iterations. Our IGEV-Stereo ranks $1^{st}$ on KITTI 2015 and 2012 (Reflective) among all published methods and is the fastest among the top 10 methods. In addition, IGEV-Stereo has strong cross-dataset generalization as well as high inference efficiency. We also extend our IGEV to multi-view stereo (MVS), i.e. IGEV-MVS, which achieves competitive accuracy on DTU benchmark. Code is available at https://github.com/gangweiX/IGEV.

Code Repositories

gangweix/igev
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
omnnidirectional-stereo-depth-estimation-onIGEV-Stereo
Depth-LRCE: 1.203
Depth-MAE: 1.860
Depth-MARE: 0.146
Depth-RMSE: 4.474
Disp-MAE: 0.225
Disp-MARE: 0.172
Disp-RMSE: 0.423

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp