CPU Inference of LLama 3.2
Opened this issue · 0 comments
FedorSymkin commented
Hello. I need CPU Inference of LLama 3.2, and I've found opened PR #202 for it. Do you have plans to merge it? I could not find any mentions about CPU mode in main, so I assume this PR is still actual