Run neural-chat 7b inference with Deepspeed on Flex 140. #10507
Closed this issue · 4 comments
weiseng-yeap commented
plusbang commented
Hi, @weiseng-yeap , we had some update in our env-check script. Could you please try the new script (https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/scripts) and attach related information?
Besides, the used GPU memory is mainly related to the model size, applied precision and input length. According to your screenshot, the GPU power is very low.
weiseng-yeap commented
Hi BinBin
Using latest script with attached latest log.
Env_V2.txt
plusbang commented
Using latest script with attached latest log. Env_V2.txt
It seems intel-fw-gpu
and intel-i915-dkms
is not installed. Please try sudo apt install intel-i915-dkms intel-fw-gpu
first.
weiseng-yeap commented
Attached with latest env.
Uploading Env_V3.txt…