iteratorlee's Stars
ggerganov/llama.cpp
LLM inference in C/C++
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
typst/typst
A new markup-based typesetting system that is powerful and easy to learn.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
waydabber/BetterDisplay
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
LazyVim/LazyVim
Neovim config for the lazy
Caldis/Mos
一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS
stas00/ml-engineering
Machine Learning Engineering Open Book
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
tensorchord/envd
🏕️ Reproducible development environment
laekov/fastmoe
A fast MoE impl for PyTorch
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
punica-ai/punica
Serving multiple LoRA finetuned LLM as one
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
hpcaitech/EnergonAI
Large-scale model inference.
epfml/landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
FMInference/H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
FMInference/DejaVu
symisc/tiny-dream
Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
FlagOpen/FlagAttention
A collection of memory efficient attention operators implemented in the Triton language.
alpa-projects/mms
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)
UofT-EcoSystem/hotline
pingzhili/light-fairseq
Conversions of Fairseq models in HuggingFace-style