HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models

Liu Zhibin ; Dong Haoye ; Chharia Aviral ; Wu Hefeng

Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video
  Diffusion Models

Abstract

Generating lifelike 3D humans from a single RGB image remains a challengingtask in computer vision, as it requires accurate modeling of geometry,high-quality texture, and plausible unseen parts. Existing methods typicallyuse multi-view diffusion models for 3D generation, but they often faceinconsistent view issues, which hinder high-quality 3D human generation. Toaddress this, we propose Human-VDM, a novel method for generating 3D human froma single RGB image using Video Diffusion Models. Human-VDM provides temporallyconsistent views for 3D human generation using Gaussian Splatting. It consistsof three modules: a view-consistent human video diffusion module, a videoaugmentation module, and a Gaussian Splatting module. First, a single image isfed into a human video diffusion module to generate a coherent human video.Next, the video augmentation module applies super-resolution and videointerpolation to enhance the textures and geometric smoothness of the generatedvideo. Finally, the 3D Human Gaussian Splatting module learns lifelike humansunder the guidance of these high-resolution and view-consistent images.Experiments demonstrate that Human-VDM achieves high-quality 3D human from asingle image, outperforming state-of-the-art methods in both generation qualityand quantity. Project page: https://human-vdm.github.io/Human-VDM/

Code Repositories

Human-VDM/Human-VDM
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
lifelike-3d-human-generation-on-thuman2-0Human-VDM
CLIP Similarity: 0.9235
LPIPS: 0.0957
PSNR: 20.068
SSIM: 0.9228

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Papers | HyperAI