HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Yijun Zhou James Gregson

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Abstract

We present an end-to-end head-pose estimation network designed to predict Euler angles through the full range head yaws from a single RGB image. Existing methods perform well for frontal views but few target head pose from all viewpoints. This has applications in autonomous driving and retail. Our network builds on multi-loss approaches with changes to loss functions and training strategies adapted to wide range estimation. Additionally, we extract ground truth labelings of anterior views from a current panoptic dataset for the first time. The resulting Wide Headpose Estimation Network (WHENet) is the first fine-grained modern method applicable to the full-range of head yaws (hence wide) yet also meets or beats state-of-the-art methods for frontal head pose estimation. Our network is compact and efficient for mobile devices and applications.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
head-pose-estimation-on-aflw2000WHENet-V
MAE: 4.83
head-pose-estimation-on-aflw2000WHENet
MAE: 5.42
head-pose-estimation-on-biwiWHENet
MAE (trained with other data): 3.81
head-pose-estimation-on-biwiWHENet-V
MAE (trained with other data): 3.48
head-pose-estimation-on-panopticWHENET
Geodesic Error (GE): 24.38

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose | Papers | HyperAI