siyan-zhao/prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"
Jupyter Notebook
Issues
- 3
AMAZING WORK! 4d mask support.
#1 opened by aldopareja
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"
Jupyter Notebook