HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Human Motion Diffusion as a Generative Prior

Yonatan Shafir Guy Tevet Roy Kapon Amit H. Bermano

Human Motion Diffusion as a Generative Prior

Abstract

Recent work has demonstrated the significant potential of denoising diffusion models for generating human motion, including text-to-motion capabilities. However, these methods are restricted by the paucity of annotated motion data, a focus on single-person motions, and a lack of detailed control. In this paper, we introduce three forms of composition based on diffusion priors: sequential, parallel, and model composition. Using sequential composition, we tackle the challenge of long sequence generation. We introduce DoubleTake, an inference-time method with which we generate long animations consisting of sequences of prompted intervals and their transitions, using a prior trained only for short clips. Using parallel composition, we show promising steps toward two-person generation. Beginning with two fixed priors as well as a few two-person training examples, we learn a slim communication block, ComMDM, to coordinate interaction between the two resulting motions. Lastly, using model composition, we first train individual priors to complete motions that realize a prescribed motion for a given joint. We then introduce DiffusionBlending, an interpolation mechanism to effectively blend several such models to enable flexible and efficient fine-grained joint and trajectory-level control and editing. We evaluate the composition methods using an off-the-shelf motion diffusion model, and further compare the results to dedicated models trained for these specific tasks.

Code Repositories

zhenzhiwang/intercontrol
pytorch
Mentioned in GitHub
priormdm/priormdm
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
motion-synthesis-on-inter-xComMDM
FID: 29.266
MMDist: 6.870
MModality: 0.771
R-Precision Top3: 0.236
motion-synthesis-on-interhumanComMDM
FID: 7.069
MMDist: 6.212
MModality: 1.822
R-Precision Top3: 0.466

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Human Motion Diffusion as a Generative Prior | Papers | HyperAI