HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL

Wu Lixia ; Li Peng ; Lou Junhong ; Fu Lei

DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL

Abstract

In addressing the pivotal role of translating natural language queries intoSQL commands, we propose a suite of compact, fine-tuned models and self-refinemechanisms to democratize data access and analysis for non-expert users,mitigating risks associated with closed-source Large Language Models.Specifically, we constructed a dataset of over 20K sample for Text-to-SQL aswell as the preference dateset, to improve the efficiency in the domain of SQLgeneration. To further ensure code validity, a code corrector was integratedinto the model. Our system, DataGpt-sql, achieved 87.2\% accuracy on thespider-dev, respectively, showcasing the effectiveness of our solution intext-to-SQL conversion tasks. Our code, data, and models are available at\url{https://github.com/CainiaoTechAi/datagpt-sql-7b}

Benchmarks

BenchmarkMethodologyMetrics
text-to-sql-on-spiderdatagpt-sql-7B + InvalidSQL-Feedback
Exact Match Accuracy (Dev): 81.6
Execution Accuracy (Dev): 87.2
text-to-sql-on-spiderdatagpt-sql-7B
Exact Match Accuracy (Dev): 80.3
Execution Accuracy (Dev): 84.8

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL | Papers | HyperAI