Command Palette
Search for a command to run...
Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection
Jingbiao Mei Jinghong Chen Guangyu Yang Weizhe Lin Bill Byrne

Abstract
Hateful memes have become a significant concern on the Internet, necessitating robust automated detection systems. While LMMs have shown promise in hateful meme detection, they face notable challenges like sub-optimal performance and limited out-of-domain generalization capabilities. Recent studies further reveal the limitations of both SFT and in-context learning when applied to LMMs in this setting. To address these issues, we propose a robust adaptation framework for hateful meme detection that enhances in-domain accuracy and cross-domain generalization while preserving the general vision-language capabilities of LMMs. Experiments on six meme classification datasets show that our approach achieves state-of-the-art performance, outperforming larger agentic systems. Moreover, our method generates higher-quality rationales for explaining hateful content compared to standard SFT, enhancing model interpretability.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| hateful-meme-classification-on-harm-p | LMM-RGCL (Qwen2-VL-7B) | Accuracy: 91.6 F1: 91.1 |
| hateful-meme-classification-on-harmeme | LMM-RGCL (Qwen2VL-7B) | AUROC: 93.2 Accuracy: 88.1 |
| hateful-meme-classification-on-harmeme | LMM-RGCL (Qwen2VL-2B) | AUROC: 92.9 Accuracy: 87.7 |
| hateful-meme-classification-on-hateful-memes-1 | LMM-RGCL (Qwen2-VL-7B) | AUROC: 91.1 |
| hateful-meme-classification-on-pridemm | LMM-RGCL (Qwen2-VL-2B) | Accuracy: 76.0 F1: 76.7 |
| hateful-meme-classification-on-pridemm | LMM-RGCL (Qwen2-VL-7B) | Accuracy: 78.1 F1: 78.4 |
| meme-classification-on-hateful-memes | LMM-RGCL (Qwen2-VL-7B) | Accuracy: 0.821 ROC-AUC: 0.911 |
| meme-classification-on-hateful-memes | LMM-RGCL (LLaVA-1.5-7B) | Accuracy: 0.809 ROC-AUC: 0.897 |
| meme-classification-on-hateful-memes | LMM-RGCL (Qwen2-VL-2B) | Accuracy: 0.791 ROC-AUC: 0.884 |
| meme-classification-on-multioff | LMM-RGCL (Qwen2-VL-7B) | Accuracy: 71.1 F1: 64.8 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.