Pinned Repositories
FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
stable-diffusion-webui
Stable Diffusion web UI
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
LHQUer's Repositories
LHQUer/stable-diffusion-webui
Stable Diffusion web UI