frankxyy opened this issue 2 years ago · 1 comments
Hi, I am wondering if llama inference now is available, thank you.
Any updates?
Is the llama2 inference is available too?
If the llama2 inference is available now, may I have a sample code which show to use lightseq to accelerate the llama2 inference?