YuriWerewolf/FlexGen
Throughput-oriented systems for large language models on commodity GPUs.
PythonApache-2.0
No issues in this repository yet.
Throughput-oriented systems for large language models on commodity GPUs.
PythonApache-2.0
No issues in this repository yet.