HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Learning to Estimate Hidden Motions with Global Motion Aggregation

Shihao Jiang Dylan Campbell Yao Lu Hongdong Li Richard Hartley

Learning to Estimate Hidden Motions with Global Motion Aggregation

Abstract

Occlusions pose a significant challenge to optical flow algorithms that rely on local evidences. We consider an occluded point to be one that is imaged in the first frame but not in the next, a slight overloading of the standard definition since it also includes points that move out-of-frame. Estimating the motion of these points is extremely difficult, particularly in the two-frame setting. Previous work relies on CNNs to learn occlusions, without much success, or requires multiple frames to reason about occlusions using temporal smoothness. In this paper, we argue that the occlusion problem can be better solved in the two-frame case by modelling image self-similarities. We introduce a global motion aggregation module, a transformer-based approach to find long-range dependencies between pixels in the first image, and perform global aggregation on the corresponding motion features. We demonstrate that the optical flow estimates in the occluded regions can be significantly improved without damaging the performance in non-occluded regions. This approach obtains new state-of-the-art results on the challenging Sintel dataset, improving the average end-point error by 13.6% on Sintel Final and 13.7% on Sintel Clean. At the time of submission, our method ranks first on these benchmarks among all published and unpublished approaches. Code is available at https://github.com/zacjiang/GMA

Code Repositories

zacjiang/GMA
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
optical-flow-estimation-on-kitti-2015-trainGMA
EPE: 4.69
F1-all: 17.1
optical-flow-estimation-on-sintel-cleanGMA
Average End-Point Error: 1.388
optical-flow-estimation-on-sintel-finalGMA
Average End-Point Error: 2.470
optical-flow-estimation-on-springGMA
1px total: 7.074

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Learning to Estimate Hidden Motions with Global Motion Aggregation | Papers | HyperAI