Chart Question Answering On Chartqa

评估指标

1:1 Accuracy

评测结果

各个模型在此基准测试上的表现结果

Paper TitleRepository
ChartPaLI-5B + PaLM 2-S81.3Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs-
Gemini Ultra80.8Gemini: A Family of Highly Capable Multimodal Models
DePlot+FlanPaLM+Codex (PoT Self-Consistency)79.3DePlot: One-shot visual language reasoning by plot-to-table translation
ChartPaLI-5B77.3Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs-
ScreenAI 5B (4.62 B params, w/ OCR)76.7ScreenAI: A Vision-Language Model for UI and Infographics Understanding
DePlot+Codex (PoT Self-Consistency)76.7DePlot: One-shot visual language reasoning by plot-to-table translation
SMoLA-PaLI-X Specialist Model74.6Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts-
SMoLA-PaLI-X Generalist Model73.8Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts-
MatCha4096 + LaMenDa72.64Synthesize Step-by-Step: Tools Templates and LLMs as Data Generators for Reasoning-Based Chart VQA-
PaLI-X (Single-task FT w/ OCR)72.3PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X (Single-task FT)70.9PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X (Multi-task FT)70.6PaLI-X: On Scaling up a Multilingual Vision and Language Model
DePlot+FlanPaLM (Self-Consistency)70.5DePlot: One-shot visual language reasoning by plot-to-table translation
PaLI-370PaLI-3 Vision Language Models: Smaller, Faster, Stronger
PaLI-3 (w/ OCR)69.5PaLI-3 Vision Language Models: Smaller, Faster, Stronger
DePlot+FlanPaLM (CoT)67.3DePlot: One-shot visual language reasoning by plot-to-table translation
Qwen-VL-Chat66.3Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
UniChart66.24UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning
Qwen-VL65.7Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
StructChart+GPT3.5 (STR ChartQA+SimChart9K)65.3StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding
0 of 27 row(s) selected.
Chart Question Answering On Chartqa | SOTA | HyperAI超神经