HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Shape Preserving Facial Landmarks with Graph Attention Networks

Prados-Torreblanca Andrés ; Buenaposada José M. ; Baumela Luis

Shape Preserving Facial Landmarks with Graph Attention Networks

Abstract

Top-performing landmark estimation algorithms are based on exploiting theexcellent ability of large convolutional neural networks (CNNs) to representlocal appearance. However, it is well known that they can only learn weakspatial relationships. To address this problem, we propose a model based on thecombination of a CNN with a cascade of Graph Attention Network regressors. Tothis end, we introduce an encoding that jointly represents the appearance andlocation of facial landmarks and an attention mechanism to weigh theinformation according to its reliability. This is combined with a multi-taskapproach to initialize the location of graph nodes and a coarse-to-finelandmark description scheme. Our experiments confirm that the proposed modellearns a global representation of the structure of the face, achieving topperformance in popular benchmarks on head pose and landmark estimation. Theimprovement provided by our model is most significant in situations involvinglarge changes in the local appearance of landmarks.

Code Repositories

andresprados/spiga
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
face-alignment-on-300wSPIGA
NME_inter-ocular (%, Challenge): 4.66
NME_inter-ocular (%, Common): 2.59
NME_inter-ocular (%, Full): 2.99
NME_inter-pupil (%, Challenge): 6.73
NME_inter-pupil (%, Common): 3.59
NME_inter-pupil (%, Full): 4.20
face-alignment-on-300w-commonSPIGA
NME: 2.59
face-alignment-on-300w-split-2SPIGA
AUC@7 (box): 71.0
AUC@8 (inter-ocular): 57.27
FR@8 (inter-ocular): 0.67
NME (box): 2.03
NME (inter-ocular): 3.43
face-alignment-on-cofw-68SPIGA
AUC@7 (box): 64.1
NME (box): 2.52
NME (inter-ocular): 3.93
face-alignment-on-merl-ravSPIGA
AUC@7 (box) : 78.47
NME (box): 1.51
face-alignment-on-wflwSPIGA
AUC@10 (inter-ocular): 60.56
FR@10 (inter-ocular): 2.08
NME (inter-ocular): 4.06
face-alignment-on-wfw-extra-dataSPIGA
AUC@10 (inter-ocular): 60.56
FR@10 (inter-ocular): 2.08
NME (inter-ocular): 4.06
facial-landmark-detection-on-300wSPIGA (Inter-ocular Norm)
NME: 2.99
head-pose-estimation-on-wflwSPIGA
MAE mean (º): 1.52
MAE pitch (º): 1.86
MAE roll (º): 0.93
MAE yaw (º): 1.78
pose-estimation-on-300w-fullSPIGA
MAE mean (º): 1.29
MAE pitch (º): 1.70
MAE roll (º): 0.77
MAE yaw (º): 1.41
pose-estimation-on-merl-ravSPIGA
MAE mean (º): 2.39
MAE pitch (º): 2.24
MAE roll (º): 1.71
MAE yaw (º): 3.23

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Shape Preserving Facial Landmarks with Graph Attention Networks | Papers | HyperAI