HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

Zeng Ailing ; Ju Xuan ; Yang Lei ; Gao Ruiyuan ; Zhu Xizhou ; Dai Bo ; Xu Qiang

DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation

Abstract

This paper proposes a simple baseline framework for video-based 2D/3D humanpose estimation that can achieve 10 times efficiency improvement over existingworks without any performance degradation, named DeciWatch. Unlike currentsolutions that estimate each frame in a video, DeciWatch introduces a simpleyet effective sample-denoise-recover framework that only watches sparselysampled frames, taking advantage of the continuity of human motions and thelightweight pose representation. Specifically, DeciWatch uniformly samples lessthan 10% video frames for detailed estimation, denoises the estimated 2D/3Dposes with an efficient Transformer architecture, and then accurately recoversthe rest of the frames using another Transformer-based network. Comprehensiveexperimental results on three video-based human pose estimation and body meshrecovery tasks with four datasets validate the efficiency and effectiveness ofDeciWatch. Code is available at https://github.com/cure-lab/DeciWatch.

Code Repositories

cure-lab/DeciWatch
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
2d-human-pose-estimation-on-jhmdb-2d-posesDeciWatch
PCK: 98.8
3d-human-pose-estimation-on-3dpwDeciWatch-PARE
MPJPE: 75.5
PA-MPJPE: 46.4
3d-human-pose-estimation-on-aistDeciWatch
MPJPE: 67.2
Single-view: Y
3d-human-pose-estimation-on-human36mDeciWatch
Average MPJPE (mm): 53.1
pose-estimation-on-j-hmdbDeciWatch
Mean PCK@0.05: 80.6
Mean PCK@0.1: 94.6
Mean PCK@0.2: 99.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation | Papers | HyperAI