HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

Abstract

Video generation has increasingly gained interest in both academia and industry. Although commercial tools can generate plausible videos, there is a limited number of open-source models available for researchers and engineers. In this work, we introduce two diffusion models for high-quality video generation, namely text-to-video (T2V) and image-to-video (I2V) models. T2V models synthesize a video based on a given text input, while I2V models incorporate an additional image input. Our proposed T2V model can generate realistic and cinematic-quality videos with a resolution of $1024 \times 576$, outperforming other open-source T2V models in terms of quality. The I2V model is designed to produce videos that strictly adhere to the content of the provided reference image, preserving its content, structure, and style. This model is the first open-source I2V foundation model capable of transforming a given image into a video clip while maintaining content preservation constraints. We believe that these open-source video generation models will contribute significantly to the technological advancements within the community.

Code Repositories

invictus717/interactivevideo
pytorch
Mentioned in GitHub
ailab-cvc/videocrafter
Official
pytorch
Mentioned in GitHub
videocrafter/videocrafter
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
text-to-video-generation-on-evalcrafter-textVideoCrafter1
Motion Quality: 60.85
Temporal Consistency: 55.89
Text-to-Video Alignment: 61.95
Total Score: 232
Visual Quality: 53.08

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation | Papers | HyperAI