Specify GPU Selection (e.g., CUDA:0, CUDA:1)

Question

Specify GPU Selection (e.g., CUDA:0, CUDA:1)

Opened this issue 2 months ago · 4 comments

RakshitAralimatti commented 2 months ago

Hi,

Is there a way to specify which GPU to use for inference, such as restricting it to only cuda:0 or cuda:1 in the code? Or are there any workarounds for achieving this?

Thanks in advance.

Answer 1 · 2024-10-31T10:30:14.000Z

You can use tensor split [1,0,0] to ignore cuda 1 and 2 and keep on 0.

Also use split mode none to increase perf if it stays on only one gpu

Answer 2 · 2024-11-04T05:48:48.000Z

Hi @ExtReMLapin Thanks for your reply!
I tried the way you said but got stuck so can you please elaborate in more detail way

Answer 3 · 2024-11-04T06:07:54.000Z

It’s the Llama class arguments

Answer 4 · 2024-11-09T04:21:10.000Z

@ExtReMLapin Got it Thanks