HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Deep Ordinal Regression Network for Monocular Depth Estimation

Huan Fu; Mingming Gong; Chaohui Wang; Kayhan Batmanghelich; Dacheng Tao

Deep Ordinal Regression Network for Monocular Depth Estimation

Abstract

Monocular depth estimation, which plays a crucial role in understanding 3D scene geometry, is an ill-posed problem. Recent methods have gained significant improvement by exploring image-level information and hierarchical features from deep convolutional neural networks (DCNNs). These methods model depth estimation as a regression problem and train the regression networks by minimizing mean squared error, which suffers from slow convergence and unsatisfactory local solutions. Besides, existing depth estimation networks employ repeated spatial pooling operations, resulting in undesirable low-resolution feature maps. To obtain high-resolution depth maps, skip-connections or multi-layer deconvolution networks are required, which complicates network training and consumes much more computations. To eliminate or at least largely reduce these problems, we introduce a spacing-increasing discretization (SID) strategy to discretize depth and recast depth network learning as an ordinal regression problem. By training the network using an ordinary regression loss, our method achieves much higher accuracy and \dd{faster convergence in synch}. Furthermore, we adopt a multi-scale network structure which avoids unnecessary spatial pooling and captures multi-scale information in parallel. The method described in this paper achieves state-of-the-art results on four challenging benchmarks, i.e., KITTI [17], ScanNet [9], Make3D [50], and NYU Depth v2 [42], and win the 1st prize in Robust Vision Challenge 2018. Code has been made available at: https://github.com/hufu6371/DORN.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
depth-estimation-on-nyu-depth-v2DORN
RMS: 0.509
monocular-depth-estimation-on-kitti-eigenDORN
Delta u003c 1.25: 0.932
Delta u003c 1.25^2: 0.984
Delta u003c 1.25^3: 0.994
RMSE: 2.727
RMSE log: 0.120
absolute relative error: 0.072
monocular-depth-estimation-on-nyu-depth-v2DORN
RMSE: 0.509

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Deep Ordinal Regression Network for Monocular Depth Estimation | Papers | HyperAI