deep-diver/LLM-As-Chatbot

Internet search is very slow using orca mini on 4bit in google colab T4 on gradio

Opened this issue · 3 comments

i waited for more than 15 minutes and didnt get the response

which orcamini did you choose? 7B or 13B?

i chose 13B and loaded it in 4bit