Command Palette
Search for a command to run...
{Eraldo Ribeiro Nima Aghli}
Abstract
Head-pose estimation from images is an important research topic in computer vision. Its many applications include detecting focus of attention, tracking driver behavior, and human-computer interaction. Recent research on head-pose estimation has focused on developing models based on deep convolutional neural networks (CNNs). These models are trained using transfer-learning and image augmentation to achieve better initiation states and robustness against occlusions. However, methods that use transfer-learning networks are usually aimed at general image recognition and offer no in-depth study of transfer learning from more task-related networks. Additionally, for the head-pose estimation, robustness against heavy occlusion, and noise such as motion blur and low-brightness are vital. In this paper, we propose a new image-augmentation approach that significantly improves the estimation accuracy of the head-pose model. We also propose a task-related weight initialization to further improve the estimation accuracy by studying internal activations of models trained for face-related tasks such as face-recognition. We test our head-pose estimation model on three challenging test sets and achieve better results to state-of-the-art methods.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| head-pose-estimation-on-aflw2000 | DDD-Pose | MAE: 4.22 |
| head-pose-estimation-on-biwi | DDD-Pose | MAE (trained with BIWI data): 2.80 MAE (trained with other data): 4.52 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.