cnbeining's Stars
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
deep-diver/LLM-As-Chatbot
LLM as a Chatbot Service
stochasticai/xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
imaurer/awesome-llm-json
Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
tloen/llama-int8
Quantized inference code for LLaMA models
aiwaves-cn/RecurrentGPT
Official Code for Paper: RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text
replit/ReplitLM
Inference code and configs for the ReplitLM model family
msoedov/langcorn
⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops
sail-sg/lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Vahe1994/SpQR
bigcode-project/starcoder.cpp
C++ implementation for 💫StarCoder
pointnetwork/point-alpaca
abacaj/replit-3B-inference
Run inference on replit-3B code instruct model using CPU
CrazyCreativeDream/Real-Coze-API
高性能的真·Coze API
zsc/llama_infer
Inference script for Meta's LLaMA models using Hugging Face wrapper
rmihaylov/mpttune
Tune MPTs
Omico/Gradm
Gradm (Gradle dependencies manager)