HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation

Weihao Yuan Xiaodong Gu Zuozhuo Dai Siyu Zhu Ping Tan

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation

Abstract

Estimating the accurate depth from a single image is challenging since it is inherently ambiguous and ill-posed. While recent works design increasingly complicated and powerful networks to directly regress the depth map, we take the path of CRFs optimization. Due to the expensive computation, CRFs are usually performed between neighborhoods rather than the whole graph. To leverage the potential of fully-connected CRFs, we split the input into windows and perform the FC-CRFs optimization within each window, which reduces the computation complexity and makes FC-CRFs feasible. To better capture the relationships between nodes in the graph, we exploit the multi-head attention mechanism to compute a multi-head potential function, which is fed to the networks to output an optimized depth map. Then we build a bottom-up-top-down structure, where this neural window FC-CRFs module serves as the decoder, and a vision transformer serves as the encoder. The experiments demonstrate that our method significantly improves the performance across all metrics on both the KITTI and NYUv2 datasets, compared to previous methods. Furthermore, the proposed method can be directly applied to panorama images and outperforms all previous panorama methods on the MatterPort3D dataset. Project page: https://weihaosky.github.io/newcrfs.

Code Repositories

aliyun/NeWCRFs
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
monocular-depth-estimation-on-kitti-eigenNeWCRFs
Delta u003c 1.25: 0.974
Delta u003c 1.25^2: 0.997
Delta u003c 1.25^3: 0.999
RMSE: 2.129
RMSE log: 0.079
Sq Rel: 0.155
absolute relative error: 0.052
monocular-depth-estimation-on-matterport3dNeWCRFs
Delta u003c 1.25: 0.9376
Delta u003c 1.25^2: 0.9812
Delta u003c 1.25^3: 0.9933
RMSE: 0.4279
absolute error: 0.197
absolute relative error: 0.0793
monocular-depth-estimation-on-nyu-depth-v2NeWCRFs
Delta u003c 1.25: 0.922
Delta u003c 1.25^2: 0.992
Delta u003c 1.25^3: 0.998
RMSE: 0.334
absolute relative error: 0.095
log 10: 0.041

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation | Papers | HyperAI