Question Answering On Convfinqa
评估指标
Execution Accuracy
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| GPT-4 (8k) | 76.48 | Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks | - |
| FinQANet (RoBERTa-large) | 68.9 | ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering | |
| General Crowd | 46.90 | Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks | - |
0 of 3 row(s) selected.