| DePlot+FlanPaLM+Codex (PoT Self-Consistency) | 79.3 | DePlot: One-shot visual language reasoning by plot-to-table translation | |
| ScreenAI 5B (4.62 B params, w/ OCR) | 76.7 | ScreenAI: A Vision-Language Model for UI and Infographics Understanding | |
| DePlot+Codex (PoT Self-Consistency) | 76.7 | DePlot: One-shot visual language reasoning by plot-to-table translation | |
| PaLI-X (Single-task FT w/ OCR) | 72.3 | PaLI-X: On Scaling up a Multilingual Vision and Language Model | |
| DePlot+FlanPaLM (Self-Consistency) | 70.5 | DePlot: One-shot visual language reasoning by plot-to-table translation | |
| StructChart+GPT3.5 (STR ChartQA+SimChart9K) | 65.3 | StructChart: On the Schema, Metric, and Augmentation for Visual Chart
Understanding | |