VoVoR/TextToSQL

In-Context Learning framework for text-to-sql problem

Python

TextToSQL Task

A simple approach for automated prompting for
text-to-sql in-context learning

LLM

Fine-tunung (FT):

I would use Codegen-16b or StarCoder and spider dev set for fine-tuning. Perhaps the data annotation will be needed for (FT)

In-Context Learning (ICL):

I used GPT-4 GUI mostly. For proper evaluation it would be some API without rate limiters or local inference server with open-sourced models:

LLama-65b
falcon-40b-instruct
Bloom
Dolly-2
Etc

Eval

Spider
Bird

Metricies

Percentage of predictions which are valid SQL (VA)
Execution accuracy (EX)
Component Matching (CM)

Roadmap