libraryofcelsus/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
PythonMIT
Watchers
No one’s watching this repository yet.
A fast inference library for running LLMs locally on modern consumer-class GPUs
PythonMIT
No one’s watching this repository yet.