HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

FovVideoVDP: A visible difference predictor for wide field-of-view video

{Anjul Patney Trisha Lian ROMAIN BACHY GIZEM RUFO Anton Kaplanyan ALEXANDRE CHAPIRO Gyorgy Denes Rafał K. Mantiuk}

FovVideoVDP: A visible difference predictor for wide field-of-view video

Abstract

FovVideoVDP is a video difference metric that models the spatial, temporal, and peripheral aspects of perception. While many other metrics are available, our work provides the first practical treatment of these three central aspects of vision simultaneously. The complex interplay between spatial and temporal sensitivity across retinal locations is especially important for displays that cover a large field-of-view, such as Virtual and Augmented Reality displays, and associated methods, such as foveated rendering. Our metric is derived from psychophysical studies of the early visual system, which model spatio-temporal contrast sensitivity, cortical magnification and contrast masking. It accounts for physical specification of the display (luminance, size, resolution) and viewing distance. To validate the metric, we collected a novel foveated rendering dataset which captures quality degradation due to sampling and reconstruction. To demonstrate our algorithm’s generality, we test it on 3 independent foveated video datasets, and on a large image quality dataset, achieving the best performance across all datasets when compared to the state-of-the-art.

Benchmarks

BenchmarkMethodologyMetrics
video-quality-assessment-on-msu-video-quality-1FovVideoVDP
KLCC: 0.382
PLCC: 0.607
SRCC: 0.537

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
FovVideoVDP: A visible difference predictor for wide field-of-view video | Papers | HyperAI