/are-copilots-local-yet

Are Copilots Local Yet? Explore the frontier of local LLM Copilots for code completion, project generation, shell assistance, and more. Uncover the tools and trends shaping tomorrow's developer experience today!

Primary LanguagePythonMIT LicenseMIT

🛠️ Are Copilots Local Yet?

Current trends and state of the art for using open & local LLM models as copilots to complete code, generate projects, act as shell assistants, automatically fix bugs, and more.

📝 Help keep this list relevant and up-to-date by making edits!

📋 Summary

Local Copilots are in an early experimental stage, with most being of MVP-quality.

The reasons for this are:

  • 📉 Local models still being inferior to Copilot
  • 🔧 Difficult to set up
  • 💻 High hardware requirements

However, as models improve, and editor extensions get developed to use them, we're expected to get a renaissance of code-completion tools.

This document is a curated list of local Copilots, shell assistants, and related projects. It is intended to be a resource for those interested in a survey of the existing tools, and to help developers discover the state of the art for projects like these.

📚 Background

In 2021, GitHub released Copilot which quickly became popular among devs. Since then, with the flurry of AI developments around LLMs, local models that can run on consumer machines have become available, and it has seemed only a matter of time before Copilot will go local.

Many perceived limitations of GitHub's Copilot are related to its closed and cloud-hosted nature.

As an alternative, local Copilots enable:

  • 🌐 Offline & private use
  • ⚡ Improved responsiveness
  • 📚 Better project/context awareness
  • 🎯 The ability to run models specialized for a particular language/task
  • 🔒 Constraining the LLM output to fit a particular format/syntax.

🧩 Extensions

Editor extensions used to complete code using LLMs:

Name Editor Released Notes
GitHub Copilot VSCode, vim 6262 2021-6-29 The GitHub Original, not local or open-source.
Cursor VSCode 17950 2023-3-14 Fork of VSCode, not open-source
Fauxpilot VSCode 13267 2022-9-3 Early local PoC. Stale?
Tabby VSCode 11242 2023-9-30 Completes the cursor selection
turbopilot VSCode 3840 2023-4-10 Completions with FIM support, inspired by fauxpilot
HuggingFace-vscode VSCode 785 2023-6-19 Fork of Tabnine, supports Starcoder
localpilot VSCode 1275 2023-10-2 Utility for easily hosting models locally, for use with official Copilot extension using custom API endpoint.
StarcoderEx VSCode 79 2023-5-5 Completes the cursor selection
WizardCoder-VSC VSCode 109 2023-6-19 PoC, article available
KoboldAIConnect VSCode 2023-10-7 Copilot clone using local KoboldAI backend
gen.nvim vim 95 2023-10-1 Edit selection using custom prompts
uniteai VSCode, emacs, lsp 122 2023-8-27

🛠️ Tools

Tools that try to generate projects/features from specification:

Name Released Notes
gpt-engineer 44619 2023-6-6 Specify what you want it to build, the AI asks for clarification, and then builds it.
gpt-pilot 11815 2023-7-18 Very similar to gpt-engineer
continue 4428 2023-5-24 VSCode extension. Task-based autocomplete
aider 4146 2023-6-8 AI pair programming in your terminal, works well with pre-existing, larger codebases
rift 2719 2023-6-20 VSCode extension. Lets you write code by chatting, makes your IDE agentic, AI engineer that works alongside you.
mentat 1667 2023-7-25 Mentat coordinates edits across multiple locations and files.
clippinator 250 2023-4-15 Uses a team of agents to plan, write, debug, and test
Refact.AI 1008 2023-10-06 Full self-hostable code completion, chat and training service, complete with VSCode extension.

🗨️ Chat Interfaces

Chat interfaces with shell/REPL/notebook access. Similar to/inspired by ChatGPT's "Advanced Data Analysis" feature (previously "Code Interpreter").

Name Notes
open-interpreter 29191 open-source, locally running implementation of OpenAI's Code Interpreter
gptme 94 Supporting open models. Developed by me, @ErikBjare
octogen 169 Local Code Interpreter executing in Docker environment.
terminal-x 30 Very early prototype that converts natural language into shell commands, unmaintained since Sept. 2021

🤖 Models

Models relevant for local Copilot-use. Ordered by most recent first.

Name Size Languages Released Notes
Phind CodeLlama v2 34B Many 359 2023-8-27
WizardCoder-Python 7/13/34B Python 614 2023-8
CodeLlama 7/13/34B Many 10414 2023-8
WizardCoder 15B 80+ 621 2023-6 Fine-tuning of Starcoder
replit-glaive 3B 1? 84 2023-7 Small model fine-tuned on high-quality data with impressive performance.
Starcoder 15B 80+ 6441 2023-5
replit-v1-3b 3B 20+ 692 2023-5
SantaCoder 1.1B Python, Java, JavaScript 300 2023-4 Tiny model selectively trained on 3 languages from 'The Stack'

Note: due to the pace of new model releases, this section is doomed to be out of date.

📚 Datasets

Datasets relevant for training models.

Name Size Languages Released Notes
The Stack 3TB/6TB 358 >500 2022-10 Excludes weak-copyleft licenses (MPL, LGPL, EGL) since v1.1

Tools

Misc relevant useful tools.

Name Released Notes
ollama 9526 2023-8-27 Easily get up and running with large language models locally.

📰 Misc

📈 Stats

Stargazers over time:

Stargazers over time