🧑‍🔬 Tabby Registry

Completion models (`--model`)

We recommend using

For 1B to 3B models, it's advisable to have at least NVIDIA T4, 10 Series, or 20 Series GPUs, or Apple Silicon like the M1.
For 7B to 13B models, we recommend using NVIDIA V100, A100, 30 Series, or 40 Series GPUs.

We have published benchmarks for these models on https://leaderboard.tabbyml.com for Tabby's users to consider when making trade-offs between quality, licensing, and model size.

Model ID	License
TabbyML/StarCoder-1B	BigCode-OpenRAIL-M
TabbyML/StarCoder-3B	BigCode-OpenRAIL-M
TabbyML/StarCoder-7B	BigCode-OpenRAIL-M
TabbyML/StarCoder2-3B	BigCode-OpenRAIL-M
TabbyML/StarCoder2-7B	BigCode-OpenRAIL-M
TabbyML/CodeLlama-7B	Llama 2
TabbyML/CodeLlama-13B	Llama 2
TabbyML/DeepseekCoder-1.3B	Deepseek License
TabbyML/DeepseekCoder-6.7B	Deepseek License
TabbyML/CodeGemma-2B	Gemma License
TabbyML/CodeGemma-7B	Gemma License
TabbyML/CodeQwen-7B	Tongyi Qianwen License
TabbyML/Codestral-22B	Mistral AI Non-Production License
TabbyML/DeepSeek-Coder-V2-Lite	Deepseek License

Chat models (`--chat-model`)

To ensure optimal response quality, and given that latency requirements are not stringent in this scenario, we recommend using a model with at least 1B parameters.

Model ID	License
TabbyML/Mistral-7B	Apache 2.0
TabbyML/CodeGemma-7B-Instruct	Gemma License
TabbyML/Qwen2-1.5B-Instruct	Apache 2.0
TabbyML/CodeQwen-7B-Chat	Tongyi Qianwen License
TabbyML/Codestral-22B	Mistral AI Non-Production License

Galtvam/registry-tabby

🧑‍🔬 Tabby Registry

Completion models (--model)

Chat models (--chat-model)

Completion models (`--model`)

Chat models (`--chat-model`)