wuchangping's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
huggingface/candle
Minimalist ML framework for Rust
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
triton-lang/triton
Development repository for the Triton language and compiler
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
guoyww/AnimateDiff
Official implementation of AnimateDiff.
arogozhnikov/einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
kohya-ss/sd-scripts
TencentARC/T2I-Adapter
T2I-Adapter
ztxz16/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
SeldonIO/alibi
Algorithms for explaining machine learning models
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
OpenLMLab/MOSS-RLHF
MOSS-RLHF
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
MegEngine/InferLLM
a lightweight LLM model inference framework
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
facebookresearch/dadaptation
D-Adaptation for SGD, Adam and AdaGrad
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
Ascend/pytorch
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
KohakuBlueleaf/HyperKohaku
A diffusers based implementation of HyperDreamBooth