Command Palette
Search for a command to run...
MSc-SQL: Multi-Sample Critiquing Small Language Models For Text-To-SQL Translation
Gorti Satya Krishna ; Gofman Ilan ; Liu Zhaoyan ; Wu Jiapeng ; Vouitsis Noël ; Yu Guangwei ; Cresswell Jesse C. ; Hosseinzadeh Rasa

Abstract
Text-to-SQL generation enables non-experts to interact with databases vianatural language. Recent advances rely on large closed-source models like GPT-4that present challenges in accessibility, privacy, and latency. To addressthese issues, we focus on developing small, efficient, and open-sourcetext-to-SQL models. We demonstrate the benefits of sampling multiple candidateSQL generations and propose our method, MSc-SQL, to critique them usingassociated metadata. Our sample critiquing model evaluates multiple outputssimultaneously, achieving state-of-the-art performance compared to otheropen-source models while remaining competitive with larger models at a muchlower cost. Full code can be found at https://github.com/layer6ai-labs/msc-sql.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| text-to-sql-on-bird-big-bench-for-large-scale | MSc-SQL | Execution Accuracy % (Dev): 65.6 |
| text-to-sql-on-spider | MSc-SQL | Execution Accuracy (Test): 84.7 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.