text-to-sql
There are 145 repositories under text-to-sql topic.
sinaptik-ai/pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Canner/WrenAI
⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered insights in seconds.
sqlchat/sqlchat
Chat-based SQL Client and Editor for the next decade
Dataherald/dataherald
Interact with your SQL database, Natural Language to SQL using LLMs
eosphoros-ai/Awesome-Text2SQL
Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.
eosphoros-ai/DB-GPT-Hub
A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL
HKUSTDial/NL2SQL_Handbook
[TKDE'25] This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners. Official repo for A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?
Mindinventory/MindSQL
MindSQL: A Python Text-to-SQL RAG Library simplifying database interactions. Seamlessly integrates with PostgreSQL, MySQL, SQLite, Snowflake, and BigQuery. Powered by GPT-4 and Llama 2, it enables natural language queries. Supports ChromaDB and Faiss for context-aware responses.
premAI-io/premsql
End-to-End Local-First Text-to-SQL Pipelines
microsoft/ContextualSP
Multiple paper open-source codes of the Microsoft Research Asia DKI group
RUCKBReasoning/OmniSQL
[VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.
salesforce/TabularSemanticParsing
Translating natural language questions to structured query language (SQL)
totalhack/zillion
Make sense of it all. Semantic data modeling and analytics with a sprinkle of AI. https://totalhack.github.io/zillion/
deepsense-ai/db-ally
Efficient, consistent and secure library for querying structured data with natural language
X-LANCE/text2sql-lgesql
[ACL 2021] This is the project containing source codes and pre-trained models about ACL2021 Long Paper ``LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations".
aws-samples/generative-bi-using-rag
A solution guidance for Generative BI using Amazon Bedrock, Amazon OpenSearch with RAG
Laqcce-cao/RSL-SQL
RSL-SQL: Robust Schema Linking in Text-to-SQL Generation
HKUSTDial/NL2SQL360
🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”
aws-samples/text-to-sql-bedrock-workshop
This repository is intended for those looking to dive deep on advanced Text-to-SQL concepts.
tshu-w/DBCopilot
Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)
glee4810/EHRSQL
[NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records
Chirayu-Tripathi/nl2query
A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.
wshi83/EhrAgent
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
aws-samples/build-an-agentic-llm-assistant
Labs for the "Build an agentic LLM assistant on AWS" workshop. A step by step agentic llm assistant development workshop using serverless three-tier architecture.
lyuqin/HydraNet-WikiSQL
Code and trained model for Hybrid ranking network for text-to-SQL on WikiSQL
rocket-connect/gqlpt
Leverage AI to generate GraphQL queries from plain text.
OSU-NLP-Group/llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
XGenerationLab/xiyan_mcp_server
A Model Context Protocol (MCP) server that enables natural language queries to databases
dermatologist/pyomop
Python package for managing OHDSI clinical data models. Includes support for LLM based plain text queries!
databrainhq/dataneuron
Data Neuron is a powerful framework that enables you to build text-to-SQL applications with an easily maintainable semantic layer. Whether you're creating customer-facing chatbots, internal Slack bots for analytics, or other data-driven applications, Data Neuron provides the tools to make your data accessible through natural language
MSR-LIT/Splash
Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing
Chia-Hsuan-Lee/KaggleDBQA
Introduction page of a challenging text-to-SQL dataset: KaggleDBQA
yandex-research/sparqling-queries
This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin
BrettlyCD/text-to-sql
An application to write and run SQL queries, returning answers to natural language questions, using langchain and open source LLM models through HuggingFace.
yszh8/TriageSQL
The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"