Command Palette
Search for a command to run...
6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry
Chun Sungho ; Chang Ju Yong

Abstract
This study addresses the nuanced challenge of estimating head translationswithin the context of six-degrees-of-freedom (6DoF) head pose estimation,placing emphasis on this aspect over the more commonly studied head rotations.Identifying a gap in existing methodologies, we recognized the underutilizedpotential synergy between facial geometry and head translation. To bridge thisgap, we propose a novel approach called the head Translation, Rotation, andface Geometry network (TRG), which stands out for its explicit bidirectionalinteraction structure. This structure has been carefully designed to leveragethe complementary relationship between face geometry and head translation,marking a significant advancement in the field of head pose estimation. Ourcontributions also include the development of a strategy for estimatingbounding box correction parameters and a technique for aligning landmarks toimage. Both of these innovations demonstrate superior performance in 6DoF headpose estimation tasks. Extensive experiments conducted on ARKitFace and BIWIdatasets confirm that the proposed method outperforms current state-of-the-arttechniques. Codes are released at https://github.com/asw91666/TRG-Release.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| head-pose-estimation-on-arkitface | TRG (w/ 300WLP) | MAE(val): 0.91 MAE_t: 3.62 |
| head-pose-estimation-on-arkitface | TRG | MAE(val): 0.92 MAE_t: 3.64 |
| head-pose-estimation-on-biwi | TRG (w/ 300WLP) | Geodesic Error - aligned (GE): 5.68 MAE (trained with other data): 2.75 MAE_t: 12.97 |
| head-pose-estimation-on-biwi | TRG | Geodesic Error - aligned (GE): 5.35 MAE (trained with other data): 2.89 MAE_t: 14.31 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.