HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

GeoMVSNet: Learning Multi-View Stereo With Geometry Perception

{Ronggang Wang Yuxi Hu Rui Peng Zhe Zhang}

GeoMVSNet: Learning Multi-View Stereo With Geometry Perception

Abstract

Recent cascade Multi-View Stereo (MVS) methods can efficiently estimate high-resolution depth maps through narrowing hypothesis ranges. However, previous methods ignored the vital geometric information embedded in coarse stages, leading to vulnerable cost matching and sub-optimal reconstruction results. In this paper, we propose a geometry awareness model, termed GeoMVSNet, to explicitly integrate geometric clues implied in coarse stages for delicate depth estimation. In particular, we design a two-branch geometry fusion network to extract geometric priors from coarse estimations to enhance structural feature extraction at finer stages. Besides, we embed the coarse probability volumes, which encode valuable depth distribution attributes, into the lightweight regularization network to further strengthen depth-wise geometry intuition. Meanwhile, we apply the frequency domain filtering to mitigate the negative impact of the high-frequency regions and adopt the curriculum learning strategy to progressively boost the geometry integration of the model. To intensify the full-scene geometry perception of our model, we present the depth distribution similarity loss based on the Gaussian-Mixture Model assumption. Extensive experiments on DTU and Tanks and Temples (T&T) datasets demonstrate that our GeoMVSNet achieves state-of-the-art results and ranks first on the T&T-Advanced set. Code is available at https://github.com/doubleZ0108/GeoMVSNet.

Benchmarks

BenchmarkMethodologyMetrics
3d-reconstruction-on-dtuGeoMVSNet
Acc: 0.331
Comp: 0.259
Overall: 0.295
point-clouds-on-tanks-and-templesGeoMVSNet
Mean F1 (Advanced): 41.52
Mean F1 (Intermediate): 65.89

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
GeoMVSNet: Learning Multi-View Stereo With Geometry Perception | Papers | HyperAI