meta-llama/llama

CPU Inference of LLama 3.2

Opened this issue · 0 comments

Hello. I need CPU Inference of LLama 3.2, and I've found opened PR #202 for it. Do you have plans to merge it? I could not find any mentions about CPU mode in main, so I assume this PR is still actual