HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Diverse Video Generation using a Gaussian Process Trigger

Gaurav Shrivastava Abhinav Shrivastava

Diverse Video Generation using a Gaussian Process Trigger

Abstract

Generating future frames given a few context (or past) frames is a challenging task. It requires modeling the temporal coherence of videos and multi-modality in terms of diversity in the potential future states. Current variational approaches for video generation tend to marginalize over multi-modal future outcomes. Instead, we propose to explicitly model the multi-modality in the future outcomes and leverage it to sample diverse futures. Our approach, Diverse Video Generator, uses a Gaussian Process (GP) to learn priors on future states given the past and maintains a probability distribution over possible futures given a particular sample. In addition, we leverage the changes in this distribution over time to control the sampling of diverse future states by estimating the end of ongoing sequences. That is, we use the variance of GP over the output function space to trigger a change in an action sequence. We achieve state-of-the-art results on diverse future frame generation in terms of reconstruction quality and diversity of the generated sequences.

Code Repositories

shgaurav1/DVG
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
video-prediction-on-bair-robot-pushing-1DVG
FVD: 120.03
video-prediction-on-kthDVG
Diversity: 0.483

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Diverse Video Generation using a Gaussian Process Trigger | Papers | HyperAI