graphcore-research/llm-inference-research
An experimentation platform for LLM inference optimisation
Jupyter NotebookMIT
Issues
- 2
- 1
End-to-end Inference Time Benchmarking
#2 opened by prajwal1210 - 1
Plans to release Triton SparQ?
#1 opened by liyucheng09