HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Using Self-Supervised Auxiliary Tasks to Improve Fine-Grained Facial Representation

Mahdi Pourmirzaei Gholam Ali Montazer Farzaneh Esmaili

Using Self-Supervised Auxiliary Tasks to Improve Fine-Grained Facial Representation

Abstract

In this paper, at first, the impact of ImageNet pre-training on fine-grained Facial Emotion Recognition (FER) is investigated which shows that when enough augmentations on images are applied, training from scratch provides better result than fine-tuning on ImageNet pre-training. Next, we propose a method to improve fine-grained and in-the-wild FER, called Hybrid Multi-Task Learning (HMTL). HMTL uses Self-Supervised Learning (SSL) as an auxiliary task during classical Supervised Learning (SL) in the form of Multi-Task Learning (MTL). Leveraging SSL during training can gain additional information from images for the primary fine-grained SL task. We investigate how proposed HMTL can be used in the FER domain by designing two customized version of common pre-text task techniques, puzzling and in-painting. We achieve state-of-the-art results on the AffectNet benchmark via two types of HMTL, without utilizing pre-training on additional data. Experimental results on the common SSL pre-training and proposed HMTL demonstrate the difference and superiority of our work. However, HMTL is not only limited to FER domain. Experiments on two types of fine-grained facial tasks, i.e., head pose estimation and gender recognition, reveals the potential of using HMTL to improve fine-grained facial representation.

Benchmarks

BenchmarkMethodologyMetrics
facial-expression-recognition-on-affectnetSL (B0)
Accuracy (8 emotion): 60.34
facial-expression-recognition-on-affectnetSL + SSL puzzling (B0)
Accuracy (8 emotion): 61.09
facial-expression-recognition-on-affectnetSL+ SSL in-painting-pl + 20% train (B0)
Accuracy (8 emotion): 55.36
facial-expression-recognition-on-affectnetSL + SSL in-panting-pl (B0)
Accuracy (8 emotion): 61.72
facial-expression-recognition-on-affectnetSL + SSL puzzling (B2)
Accuracy (8 emotion): 61.32
facial-expression-recognition-on-affectnetSL + 20% train (B0)
Accuracy (8 emotion): 52.46
facial-expression-recognition-on-affectnetSL (B2)
Accuracy (8 emotion): 60.35
facial-expression-recognition-on-affectnetSL+ SSL puzzling + 20% train (B0)
Accuracy (8 emotion): 54.98
facial-expression-recognition-on-ckNonlinear eval on SL + SSL puzzling (B0)
Accuracy (7 emotion): 98.23

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Using Self-Supervised Auxiliary Tasks to Improve Fine-Grained Facial Representation | Papers | HyperAI