firejq

@TencentShenzhen

firejq's Stars

meta-llama/llama
Inference code for Llama models
Language:Python55.7k 519 9619.5k
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
Language:Python39.8k 327 3.6k5.2k
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Language:Python38.6k 445 3065k
LC044/WeChatMsg
提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手
Language:Python33.5k 171 4033.5k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python29.9k 217 5432.7k
Stability-AI/generative-models
Generative Models by Stability AI
Language:Python24.2k 256 3042.7k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.2k 183 7311.9k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16k 107 1k1.6k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k 115 1k1.2k
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.4k 98 7771.6k
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
Language:Python12k 105 3.6k2.9k
pypa/pip
The Python package installer
Language:Python9.5k 318 7.3k3k
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python8.8k 98 1.3k1k
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.3k 88 1.8k927
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k 78 388577
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.1k 45 80537
facebookincubator/AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Language:Python4.5k 82 244363
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python4.4k 30 454466
ztxz16/fastllm
纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行
Language:C++3.3k 41 362334
libarchive/libarchive
Multi-format archive and compression library
Language:C3k 110 1.4k767
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.3k 23 179191
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook2.2k 33 87153
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.2k 21 87138
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
1.1k 12 422
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Language:Python976 15 3847
triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
Language:Python664 23 46896
ymcui/Chinese-Mixtral
中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）
Language:Python579 15 1043
git-cloner/aliendao
huggingface mirror download
Language:Python549 4 2355
OpenPPL/ppl.nn.llm
140 4 618
sarugaku/resolvelib
Resolve abstract dependencies into concrete ones
Language:Python139 13 6431

firejq

firejq's Stars

meta-llama/llama

oobabooga/text-generation-webui

Stability-AI/stablediffusion

LC044/WeChatMsg

lllyasviel/ControlNet

Stability-AI/generative-models

ymcui/Chinese-LLaMA-Alpaca

huggingface/peft

Dao-AILab/flash-attention

THUDM/ChatGLM3

PaddlePaddle/PaddleNLP

pypa/pip

huggingface/text-generation-inference

NVIDIA/TensorRT-LLM

ymcui/Chinese-LLaMA-Alpaca-2

facebookresearch/DiT

facebookincubator/AITemplate

AutoGPTQ/AutoGPTQ

ztxz16/fastllm

libarchive/libarchive

ModelTC/lightllm

FasterDecoding/Medusa

mit-han-lab/smoothquant

kvcache-ai/Mooncake

deepseek-ai/DeepSeek-MoE

triton-inference-server/tensorrtllm_backend

ymcui/Chinese-Mixtral

git-cloner/aliendao

OpenPPL/ppl.nn.llm

sarugaku/resolvelib