312shan

shenzhen

312shan's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python141k 1.1k 7.6k26.6k
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Language:Go91.9k 543 4.5k7.2k
open-webui/open-webui
User-friendly WebUI for AI (Formerly Ollama WebUI)
Language:Svelte41.2k 203 2.3k4.9k
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
Language:Python39.8k 327 3.6k5.2k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35k 342 2.7k4.1k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.8k 228 4.7k4.1k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.4k 219 2443k
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Language:Shell25.3k 308 2583.2k
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. Actively in public development.
Language:TypeScript17.9k 117 1.5k3k
meta-llama/codellama
Inference code for CodeLlama models
Language:Python15.9k 184 1951.9k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.6k 101 1k1.1k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.6k 74 1.1k1.2k
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python8.9k 99 1.3k1k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.3k 89 1.8k932
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
Language:Python7.7k 49 650844
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k 78 388578
deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Language:Python6.6k 68 158460
pliang279/awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
5.9k 178 15849
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python4.9k 49 441373
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python4.4k 31 455468
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Language:MDX3.8k 83 300589
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Language:Python2.7k 19 758245
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.4k 24 171184
xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Language:Python1.8k 17 110134
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.7k 16 393202
huybery/Awesome-Code-LLM
👨‍💻 An awesome and curated list of best code-LLM for research.
885 31 949
defog-ai/sql-eval
Evaluate the accuracy of LLM generated outputs
Language:Jupyter Notebook539 9 1856
PKU-YuanGroup/Machine-Mindset
An MBTI Exploration of Large Language Models
Language:Python456 7 221
wandb/server
W&B Server is the self hosted version of Weights & Biases
Language:HCL253 13 11321
xv44586/Chinese-instruction-datasets
中文 Instruction tuning datasets
114 2 06

312shan

312shan's Stars

AUTOMATIC1111/stable-diffusion-webui

ollama/ollama

open-webui/open-webui

oobabooga/text-generation-webui

microsoft/DeepSpeed

vllm-project/vllm

meta-llama/llama3

OpenBMB/ChatDev

danny-avila/LibreChat

meta-llama/codellama

QwenLM/Qwen

huggingface/trl

huggingface/text-generation-inference

NVIDIA/TensorRT-LLM

axolotl-ai-cloud/axolotl

ymcui/Chinese-LLaMA-Alpaca-2

deepseek-ai/DeepSeek-Coder

pliang279/awesome-multimodal-ml

QwenLM/Qwen-VL

AutoGPTQ/AutoGPTQ

huggingface/deep-rl-class

modelscope/swift

mit-han-lab/llm-awq

xlang-ai/instructor-embedding

casper-hansen/AutoAWQ

huybery/Awesome-Code-LLM

defog-ai/sql-eval

PKU-YuanGroup/Machine-Mindset

wandb/server

xv44586/Chinese-instruction-datasets