Pinned Repositories
axlearn
An Extensible Deep Learning Library
recurrent_drafter
SHARK-Turbine
Unified compiler/runtime for interfacing with PyTorch Dynamo.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
matplotlib
matplotlib: plotting with Python
SHARK-Turbine
Unified compiler/runtime for interfacing with PyTorch Dynamo.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
explainerauthors's Repositories
explainerauthors/recurrent_drafter
explainerauthors/SHARK-Turbine
Unified compiler/runtime for interfacing with PyTorch Dynamo.
explainerauthors/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs