Multimodal Intent Recognition On Photochat
评估指标
F1
Precision
Recall
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||||
|---|---|---|---|---|---|
| PaCE | 63.8 | 63.3 | 68 | PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts | |
| T5-3B | 58.9 | 54.1 | 64.6 | Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | |
| T5-base | 58.1 | 58.2 | 57.9 | Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | |
| BERT | 53.2 | 56.1 | 50.6 | BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | |
| ViLT | 52.4 | 55.4 | 58.9 | ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision | |
| ALBERT-base | 52.2 | 44.8 | 62.7 | ALBERT: A Lite BERT for Self-supervised Learning of Language Representations |
0 of 6 row(s) selected.