312shan's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
open-webui/open-webui
User-friendly WebUI for AI (Formerly Ollama WebUI)
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
meta-llama/llama3
The official Meta Llama 3 GitHub site
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. Actively in public development.
meta-llama/codellama
Inference code for CodeLlama models
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
huggingface/trl
Train transformer language models with reinforcement learning.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
huybery/Awesome-Code-LLM
👨💻 An awesome and curated list of best code-LLM for research.
defog-ai/sql-eval
Evaluate the accuracy of LLM generated outputs
PKU-YuanGroup/Machine-Mindset
An MBTI Exploration of Large Language Models
wandb/server
W&B Server is the self hosted version of Weights & Biases
xv44586/Chinese-instruction-datasets
中文 Instruction tuning datasets