HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation

AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation

Abstract

Expressive human pose and shape estimation (a.k.a. 3D whole-body meshrecovery) involves the human body, hand, and expression estimation. Mostexisting methods have tackled this task in a two-stage manner, first detectingthe human body part with an off-the-shelf detection model and inferring thedifferent human body parts individually. Despite the impressive resultsachieved, these methods suffer from 1) loss of valuable contextual informationvia cropping, 2) introducing distractions, and 3) lacking inter-associationamong different persons and body parts, inevitably causing performancedegradation, especially for crowded scenes. To address these issues, weintroduce a novel all-in-one-stage framework, AiOS, for multiple expressivehuman pose and shape recovery without an additional human detection step.Specifically, our method is built upon DETR, which treats multi-personwhole-body mesh recovery task as a progressive set prediction problem withvarious sequential detection. We devise the decoder tokens and extend them toour task. Specifically, we first employ a human token to probe a human locationin the image and encode global features for each instance, which provides acoarse location for the later transformer block. Then, we introduce ajoint-related token to probe the human joint in the image and encoder afine-grained local feature, which collaborates with the global feature toregress the whole-body mesh. This straightforward but effective modeloutperforms previous state-of-the-art methods by a 9% reduction in NMVE onAGORA, a 30% reduction in PVE on EHF, a 10% reduction in PVE on ARCTIC, and a3% reduction in PVE on EgoBody.

Code Repositories

ttxskk/AiOS
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
3d-multi-person-mesh-recovery-on-agoraAIOS
FB-NMVE: 97.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation | Papers | HyperAI