text-to-sql

There are 145 repositories under text-to-sql topic.

  • sinaptik-ai/pandas-ai

    Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

    Language:Python22.1k1708912.1k
  • vanna

    vanna-ai/vanna

    🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

    Language:Python20.4k1424411.9k
  • WrenAI

    Canner/WrenAI

    ⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered insights in seconds.

    Language:TypeScript11.9k754651.2k
  • sqlchat

    sqlchat/sqlchat

    Chat-based SQL Client and Editor for the next decade

    Language:TypeScript5.4k3279474
  • Dataherald/dataherald

    Interact with your SQL database, Natural Language to SQL using LLMs

    Language:Python3.6k2746258
  • eosphoros-ai/Awesome-Text2SQL

    Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

  • eosphoros-ai/DB-GPT-Hub

    A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

    Language:Python1.9k25170235
  • HKUSTDial/NL2SQL_Handbook

    [TKDE'25] This is a continuously updated handbook for readers to easily track the latest Text-to-SQL techniques in the literature and provide practical guidance for researchers and practitioners. Official repo for A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?

    Language:Python96013163
  • MindSQL

    Mindinventory/MindSQL

    MindSQL: A Python Text-to-SQL RAG Library simplifying database interactions. Seamlessly integrates with PostgreSQL, MySQL, SQLite, Snowflake, and BigQuery. Powered by GPT-4 and Llama 2, it enables natural language queries. Supports ChromaDB and Faiss for context-aware responses.

    Language:Python42392041
  • premAI-io/premsql

    End-to-End Local-First Text-to-SQL Pipelines

    Language:Python39672534
  • microsoft/ContextualSP

    Multiple paper open-source codes of the Microsoft Research Asia DKI group

    Language:Python383153460
  • RUCKBReasoning/OmniSQL

    [VLDB' 25] Synthesizing High-quality Text-to-SQL Data at Scale. SynSQL-2.5M is the first million-scale cross-domain text-to-SQL dataset.

    Language:Python33641838
  • salesforce/TabularSemanticParsing

    Translating natural language questions to structured query language (SQL)

    Language:Jupyter Notebook23094353
  • totalhack/zillion

    Make sense of it all. Semantic data modeling and analytics with a sprinkle of AI. https://totalhack.github.io/zillion/

    Language:Python1979176
  • deepsense-ai/db-ally

    Efficient, consistent and secure library for querying structured data with natural language

    Language:Python1604104
  • X-LANCE/text2sql-lgesql

    [ACL 2021] This is the project containing source codes and pre-trained models about ACL2021 Long Paper ``LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations".

    Language:Python15422437
  • aws-samples/generative-bi-using-rag

    A solution guidance for Generative BI using Amazon Bedrock, Amazon OpenSearch with RAG

    Language:Python14951146
  • Laqcce-cao/RSL-SQL

    RSL-SQL: Robust Schema Linking in Text-to-SQL Generation

    Language:Python13911513
  • HKUSTDial/NL2SQL360

    🔥[VLDB'24] Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?”

    Language:Python1311814
  • aws-samples/text-to-sql-bedrock-workshop

    This repository is intended for those looking to dive deep on advanced Text-to-SQL concepts.

    Language:Jupyter Notebook1237326
  • tshu-w/DBCopilot

    Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)

    Language:Python1175918
  • glee4810/EHRSQL

    [NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records

    Language:Python90459
  • Chirayu-Tripathi/nl2query

    A framework for converting natural language text inputs to corresponding Pandas, MongoDB, Kusto and Neo4j (Cypher) queries.

    Language:Python89728
  • wshi83/EhrAgent

    [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records

    Language:Python894712
  • aws-samples/build-an-agentic-llm-assistant

    Labs for the "Build an agentic LLM assistant on AWS" workshop. A step by step agentic llm assistant development workshop using serverless three-tier architecture.

    Language:Jupyter Notebook689129
  • lyuqin/HydraNet-WikiSQL

    Code and trained model for Hybrid ranking network for text-to-SQL on WikiSQL

    Language:Python6821618
  • gqlpt

    rocket-connect/gqlpt

    Leverage AI to generate GraphQL queries from plain text.

    Language:TypeScript63451
  • OSU-NLP-Group/llm-planning-eval

    [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"

    Language:Python54323
  • XGenerationLab/xiyan_mcp_server

    A Model Context Protocol (MCP) server that enables natural language queries to databases

    Language:Python527
  • pyomop

    dermatologist/pyomop

    Python package for managing OHDSI clinical data models. Includes support for LLM based plain text queries!

    Language:Python49348
  • databrainhq/dataneuron

    Data Neuron is a powerful framework that enables you to build text-to-SQL applications with an easily maintainable semantic layer. Whether you're creating customer-facing chatbots, internal Slack bots for analytics, or other data-driven applications, Data Neuron provides the tools to make your data accessible through natural language

    Language:Python45422
  • MSR-LIT/Splash

    Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

  • Chia-Hsuan-Lee/KaggleDBQA

    Introduction page of a challenging text-to-SQL dataset: KaggleDBQA

  • yandex-research/sparqling-queries

    This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

    Language:Python34417
  • BrettlyCD/text-to-sql

    An application to write and run SQL queries, returning answers to natural language questions, using langchain and open source LLM models through HuggingFace.

    Language:Jupyter Notebook331416
  • yszh8/TriageSQL

    The dataset and source code for our paper: "Did You Ask a Good Question? A Cross-Domain Question IntentionClassification Benchmark for Text-to-SQL"

    Language:Python31114