Pinned Repositories
H2O
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
SpecGame
Code for paper titled "How Speculative Can Speculative Decoding Be?".
ZhuoruiLiu12's Repositories
ZhuoruiLiu12/SpecGame
Code for paper titled "How Speculative Can Speculative Decoding Be?".
ZhuoruiLiu12/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!