ScalingIntelligence/hydragen
Hydragen: High-Throughput LLM Inference with Shared Prefixes
PythonApache-2.0
Stargazers
- Adriatogi
- anguyenbus
- BradleyBrown19Oxford, UK
- GarrickLinHangZhou
- interestingLSYPeking University
- jason-huang03Tsinghua University, NVIDIA
- jonzareckiTel Aviv, Israel
- jpwoeltjen
- jssonxRice University
- lambda7xxShanghai Jiao Tong University
- learning-chip
- manatk
- mayeechen
- Mohamad-HusseinCalgary
- nopromptFriant, CA
- quic-sanisingQualcomm Technologies, Inc.
- shreyansh26Level AI
- simonguoziruiPhD Student at Stanford
- slhuang
- Stubborn-one
- TGLTommy
- thomasbtnfrCNRS
- vsevolodlGromozeka LLC
- wellhowtosay
- XinYao1994HUST & HKU