HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

A generic diffusion-based approach for 3D human pose prediction in the wild

Saeed Saadatnejad Ali Rasekh Mohammadreza Mofayezi Yasamin Medghalchi Sara Rajabzadeh Taylor Mordan Alexandre Alahi

A generic diffusion-based approach for 3D human pose prediction in the wild

Abstract

Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a diffusion-based approach that can predict given noisy observations. We frame the prediction task as a denoising problem, where both observation and prediction are considered as a single sequence containing missing elements (whether in the observation or prediction horizon). All missing elements are treated as noise and denoised with our conditional diffusion model. To better handle long-term forecasting horizon, we present a temporal cascaded diffusion model. We demonstrate the benefits of our approach on four publicly available datasets (Human3.6M, HumanEva-I, AMASS, and 3DPW), outperforming the state-of-the-art. Additionally, we show that our framework is generic enough to improve any 3D pose prediction model as a pre-processing step to repair their inputs and a post-processing step to refine their outputs. The code is available online: \url{https://github.com/vita-epfl/DePOSit}.

Code Repositories

vita-epfl/deposit
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
human-pose-forecasting-on-3dpwTCD
FDE@1000ms (mm): 73.4
FDE@560ms (mm): 55.4
FDE@720ms (mm): 61.6
FDE@880ms (mm): 67.9
human-pose-forecasting-on-amassTCD
FDE@1000ms (mm): 66.7
FDE@560ms (mm): 49.8
FDE@720ms (mm): 54.5
FDE@880ms (mm): 60.1
human-pose-forecasting-on-human36mTCD
ADE: 356
APD: 19466
FDE: 396
MMADE: 463
MMFDE: 445
human-pose-forecasting-on-humaneva-iTCD
ADE@2000ms: 199
APD@2000ms: 6764
FDE@2000ms: 215

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A generic diffusion-based approach for 3D human pose prediction in the wild | Papers | HyperAI