Dense Captioning On Visual Genome
评估指标
mAP
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| ControlCap | 18.2 | ControlCap: Controllable Region-level Captioning | |
| GRiT (ViT-B) | 15.5 | GRiT: A Generative Region-to-text Transformer for Object Understanding | |
| CAG-Net | 10.5 | Context and Attribute Grounded Dense Captioning | - |
| FCLN | 5.4 | DenseCap: Fully Convolutional Localization Networks for Dense Captioning |
0 of 4 row(s) selected.