How to run 13B model in a single GPU just by inference.by?

Question

statyui opened this issue 2 years ago · 0 comments