juncongmoo/pyllama

How to run 13B model in a single GPU just by inference.by?

statyui opened this issue · 0 comments

How to run 13B model in a single GPU just by inference.by?