Command Palette
Search for a command to run...
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Douwe Kiela Hamed Firooz Aravind Mohan Vedanuj Goswami Amanpreet Singh Pratik Ringshia Davide Testuggine

Abstract
This work proposes a new challenge set for multimodal classification, focusing on detecting hate speech in multimodal memes. It is constructed such that unimodal models struggle and only multimodal models can succeed: difficult examples ("benign confounders") are added to the dataset to make it hard to rely on unimodal signals. The task requires subtle reasoning, yet is straightforward to evaluate as a binary classification problem. We provide baseline performance numbers for unimodal models, as well as for multimodal models with various degrees of sophistication. We find that state-of-the-art methods perform poorly compared to humans (64.73% vs. 84.7% accuracy), illustrating the difficulty of the task and highlighting the challenge that this important problem poses to the community.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| meme-classification-on-hateful-memes | Human | Accuracy: 0.847 ROC-AUC: 0.8265 |
| meme-classification-on-hateful-memes | Visual BERT COCO | Accuracy: 0.695 ROC-AUC: 0.754 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.