LinkZyy

UCASBeiJing

LinkZyy's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook94.1k 688 7.8k15.2k
d2l-ai/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Language:Python63.1k 1.1k 011k
xai-org/grok-1
Grok open release
Language:Python49.5k 562 2098.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.1k 186 4902.2k
joonspk-research/generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
17.3k 140 1262.2k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.3k 194 1.5k1.6k
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript12.3k 92 36643.5k
cpacker/MemGPT
Letta (fka MemGPT) is a framework for creating stateful LLM services.
Language:Python12k 115 7781.3k
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python10.3k 157 64811
libsdl-org/SDL
Simple Directmedia Layer
Language:C9.8k 114 7.8k1.8k
cumulo-autumn/StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Language:Python9.7k 78 118691
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.2k 44 80555
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python5.8k 57 593466
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.5k 24 177191
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook2.3k 32 89154
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.7k 24 3994
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Language:Python1.7k 15 401207
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Language:C++1.7k 32 659225
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
Language:Cuda1.4k 16 114124
punica-ai/punica
Serving multiple LoRA finetuned LLM as one
Language:Python977 12 3946
feifeibear/LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
Language:Python552 2 1656
AmadeusChan/Awesome-LLM-System-Papers
492 16 122
AI21Labs/in-context-ralm
Language:Python261 6 1225
microsoft/vattention
Dynamic Memory Management for Serving LLMs without PagedAttention
Language:C215 2 914
geohot/cuda_ioctl_sniffer
Sniff CUDA ioctls
Language:C177 6 524
ACL2023-Retrieval-LM/ACL2023-Retrieval-LM.github.io
https://acl2023-retrieval-lm.github.io/
Language:JavaScript153 5 113
nightdessert/Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
Language:Python150 2 813
princeton-nlp/MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
Language:Jupyter Notebook101 5 127
UofT-EcoSystem/DietCode
DietCode Code Release
Language:Cuda61 10 09
summerspringwei/souffle-ae
Language:Jupyter Notebook9 1 01

LinkZyy

LinkZyy's Stars

langchain-ai/langchain

d2l-ai/d2l-zh

xai-org/grok-1

hpcaitech/Open-Sora

joonspk-research/generative_agents

triton-lang/triton

academicpages/academicpages.github.io

cpacker/MemGPT

RUCAIBox/LLMSurvey

libsdl-org/SDL

cumulo-autumn/StreamDiffusion

facebookresearch/DiT

sgl-project/sglang

mit-han-lab/llm-awq

FasterDecoding/Medusa

S-LoRA/S-LoRA

casper-hansen/AutoAWQ

flexflow/FlexFlow

flashinfer-ai/flashinfer

punica-ai/punica

feifeibear/LLMSpeculativeSampling

AmadeusChan/Awesome-LLM-System-Papers

AI21Labs/in-context-ralm

microsoft/vattention

geohot/cuda_ioctl_sniffer

ACL2023-Retrieval-LM/ACL2023-Retrieval-LM.github.io

nightdessert/Retrieval_Head

princeton-nlp/MQuAKE

UofT-EcoSystem/DietCode

summerspringwei/souffle-ae