Command Palette
Search for a command to run...
Clément Godard; Oisin Mac Aodha; Michael Firman; Gabriel Brostow

Abstract
Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods. Research on self-supervised monocular training usually explores increasingly complex architectures, loss functions, and image formation models, all of which have recently helped to close the gap with fully-supervised methods. We show that a surprisingly simple model, and associated design choices, lead to superior predictions. In particular, we propose (i) a minimum reprojection loss, designed to robustly handle occlusions, (ii) a full-resolution multi-scale sampling method that reduces visual artifacts, and (iii) an auto-masking loss to ignore training pixels that violate camera motion assumptions. We demonstrate the effectiveness of each component in isolation, and show high quality, state-of-the-art results on the KITTI benchmark.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| camera-pose-estimation-on-kitti-odometry | Monodepth2 | Absolute Trajectory Error [m]: 93.04 Average Rotational Error er[%]: 20.72 Average Translational Error et[%]: 43.21 |
| monocular-depth-estimation-on-kitti-eigen | monodepth2 M | absolute relative error: 0.106 |
| monocular-depth-estimation-on-make3d | Monodepth2 | Abs Rel: 0.322 RMSE: 7.417 Sq Rel: 3.589 |
| monocular-depth-estimation-on-mid-air-dataset | Monodepth2 | Abs Rel: 0.717 RMSE: 74.552 RMSE log: 0.882 SQ Rel: 37.164 |
| monocular-depth-estimation-on-va | MonoDepth2 | Absolute relative error (AbsRel): 0.203 Log root mean square error (RMSE_log): 0.251 Mean average error (MAE) : 0.295 Root mean square error (RMSE): 0.432 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.