Text To Sql On Spider 2 0
评估指标
Success Rate
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| Spider-Agent + o1-preview | 17.03 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
| Spider-Agent + GPT-4o | 10.13 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
| Spider-Agent + Claude-3.5-Sonnect | 9.02 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
| Spider-Agent + GPT-4 | 8.86 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
| Spider-Agent + Qwen2.5-72B | 6.17 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
| Spider-Agent + DeepSeek-V2.5 | 5.22 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
| Spider-Agent + Gemini-Pro-1.5 | 2.53 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
| Spider-Agent + Llama-3.1-405B | 2.21 | Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows | - |
0 of 8 row(s) selected.