Add Metal/GPU support for running model inference

Question

Add Metal/GPU support for running model inference

singularitti opened this issue 2 years ago · 1 comments

I am no expert in this, but it seems to be running on CPUs, which could cause severe heat generation.

Answer 1 · 2023-06-20T23:51:12.000Z

@singularitti adding support for this in llama.swift to start with (see alexrozanski/llama.swift#8). this will be coming to LlamaChat v2 which is still a WIP!