HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction

Sibo Tian Minghui Zheng Xiao Liang

TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction

Abstract

Predicting human motion plays a crucial role in ensuring a safe and effective human-robot close collaboration in intelligent remanufacturing systems of the future. Existing works can be categorized into two groups: those focusing on accuracy, predicting a single future motion, and those generating diverse predictions based on observations. The former group fails to address the uncertainty and multi-modal nature of human motion, while the latter group often produces motion sequences that deviate too far from the ground truth or become unrealistic within historical contexts. To tackle these issues, we propose TransFusion, an innovative and practical diffusion-based model for 3D human motion prediction which can generate samples that are more likely to happen while maintaining a certain level of diversity. Our model leverages Transformer as the backbone with long skip connections between shallow and deep layers. Additionally, we employ the discrete cosine transform to model motion sequences in the frequency space, thereby improving performance. In contrast to prior diffusion-based models that utilize extra modules like cross-attention and adaptive layer normalization to condition the prediction on past observed motion, we treat all inputs, including conditions, as tokens to create a more lightweight model compared to existing approaches. Extensive experimental studies are conducted on benchmark datasets to validate the effectiveness of our human motion prediction model.

Code Repositories

sibotian96/TransFusion
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
human-pose-forecasting-on-amassTransFusion
ADE: 0.508
APD: 8.853
FDE: 0.568
human-pose-forecasting-on-human36mTransFusion
ADE: 358
APD: 5975
FDE: 468
MMADE: 506
MMFDE: 539
human-pose-forecasting-on-humaneva-iTransFusion
ADE@2000ms: 204
APD@2000ms: 1031
FDE@2000ms: 234
MMADE@2000ms: 408
MMFDE@2000ms: 427

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
TransFusion: A Practical and Effective Transformer-based Diffusion Model for 3D Human Motion Prediction | Papers | HyperAI