Pinned Repositories
Awesome-Video-Attention
A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and caching, etc.
Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
cse234-w25
Website for CSE 234, Winter 2025
cse234-w25-PA
dsc204a-w24
Website for DSC 204a, Winter 2024
Dynasor
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
FastVideo
A unified inference and post-training framework for accelerated video generation.
LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
MuxServe
vllm-ltr
[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank
Hao AI Lab's Repositories
hao-ai-lab/FastVideo
A unified inference and post-training framework for accelerated video generation.
hao-ai-lab/LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
hao-ai-lab/Dynasor
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
hao-ai-lab/MuxServe
hao-ai-lab/vllm-ltr
[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank
hao-ai-lab/LookaheadReasoning
[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning
hao-ai-lab/Awesome-Video-Attention
A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and caching, etc.
hao-ai-lab/cse234-w25-PA
hao-ai-lab/cse234-w25
Website for CSE 234, Winter 2025
hao-ai-lab/dsc204a-w24
Website for DSC 204a, Winter 2024
hao-ai-lab/dsc291-PA
hao-ai-lab/hao-ai-lab.github.io
hao-ai-lab/dynamo
A Datacenter Scale Distributed Inference Serving Framework
hao-ai-lab/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hao-ai-lab/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
hao-ai-lab/dsc291-s24
Website for DSC 291, Spring 2024
hao-ai-lab/env-setup
General repository to setup environments
hao-ai-lab/llmutils
LLM Utils
hao-ai-lab/sglang
SGLang is a fast serving framework for large language models and vision language models.
hao-ai-lab/dsc204a-f25
Website for DSC 204A, Fall 2025