HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Human Motion Diffusion Model

Guy Tevet Sigal Raab Brian Gordon Yonatan Shafir Daniel Cohen-Or Amit H. Bermano

Human Motion Diffusion Model

Abstract

Natural and expressive human motion generation is the holy grail of computer animation. It is a challenging task, due to the diversity of possible motion, human perceptual sensitivity to it, and the difficulty of accurately describing it. Therefore, current generative solutions are either low-quality or limited in expressiveness. Diffusion models, which have already shown remarkable generative capabilities in other domains, are promising candidates for human motion due to their many-to-many nature, but they tend to be resource hungry and hard to control. In this paper, we introduce Motion Diffusion Model (MDM), a carefully adapted classifier-free diffusion-based generative model for the human motion domain. MDM is transformer-based, combining insights from motion generation literature. A notable design-choice is the prediction of the sample, rather than the noise, in each diffusion step. This facilitates the use of established geometric losses on the locations and velocities of the motion, such as the foot contact loss. As we demonstrate, MDM is a generic approach, enabling different modes of conditioning, and different generation tasks. We show that our model is trained with lightweight resources and yet achieves state-of-the-art results on leading benchmarks for text-to-motion and action-to-motion. https://guytevet.github.io/mdm-page/ .

Code Repositories

guytevet/motion-diffusion-model
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-generation-on-e-t-the-exceptionalMDM
ClaTr-Score: 18.32
Classifier-F1: 0.34
FD_ClaTr: 6.79
motion-synthesis-on-humanact12MDM
Accuracy: 0.99
FID: 0.08
Multimodality: 2.58
motion-synthesis-on-humanml3dMDM
Diversity: 9.559
FID: 0.544
Multimodality: 2.799
R Precision Top3: 0.611
motion-synthesis-on-inter-xMDM
FID: 23.701
MMDist: 9.548
MModality: 3.490
R-Precision Top3: 0.426
motion-synthesis-on-interhumanMDM
FID: 9.167
MMDist: 7.125
MModality: 2.35
R-Precision Top3: 0.339
motion-synthesis-on-kit-motion-languageMDM
Diversity: 10.847
FID: 0.497
Multimodality: 1.907
R Precision Top3: 0.396
motion-synthesis-on-motion-xMDM
Diversity: 11.400
FID: 3.800
MModality: 2.530
TMR-Matching Score: 0.840
TMR-R-Precision Top3: 0.6341

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Human Motion Diffusion Model | Papers | HyperAI