bug in chatbot UI

Question

bug in chatbot UI

GeorvityLabs opened this issue 2 years ago · 27 comments

When i say "hello" , after telling the first sentence by text input box gets blocked with the loading animation.
i'm not able to enter anything!
i attached a screenshot below, any idea why this issue is there in chatbot UI?

also can you add "reset" button to reset chat?

Answer 1 · 2023-03-28T17:25:28.000Z

after like 5 or six questions it bugs out , any idea why this happens?

Answer 2 · 2023-03-29T06:42:43.000Z

@deep-diver any idea why there are these bugs in the chatbot UI?

Answer 3 · 2023-03-29T06:47:42.000Z

Sorry about that. Probably I need to understand Gradio better!

Will have a look into this case. Thanks for letting me know!

By the way I am hosting this in 3*A6000 now. Please try if you are interested

https://notebooksf.jarvislabs.ai/BuOu_VbEuUHb09VEVHhfnFq4-PMhBRVCcfHBRCOrq7c4O9GI4dIGoidvNf76UsRL/

Answer 4 · 2023-03-29T07:00:21.000Z

@deep-diver is the current colab example in batch mode or streaming mode?
is there a difference in code.
if the colab example is in batch mode, how can it be converted to streaming mode?

Answer 5 · 2023-03-29T07:03:36.000Z

Default mode is streaming. If you want batch mode set --batch_size higher than 1

Answer 6 · 2023-03-29T07:28:06.000Z

input_box_issues.mp4

as you can see in the video above, the input box is blocked by the "loading animation" , any idea why this happens?

Answer 7 · 2023-03-29T07:33:08.000Z

no_outputs_uibug.mp4

this is another UI bug where context box too is filled with the loading animation , any way to disable the loading animation @deep-diver
also no response is obtained from model , even after waiting for minutes.

Answer 8 · 2023-03-29T07:57:55.000Z

Are you using Colab? That happens. It looks like connection is not stable in Colab

Answer 9 · 2023-03-29T07:59:46.000Z

Are you using Colab? That happens. It looks like connection is not stable in Colab

not, colab. i am using on my local machine.
is there any way to fix this? if it is connection issue

Answer 10 · 2023-03-29T08:00:48.000Z

but i did see same errors while using colab as well @deep-diver

Answer 11 · 2023-03-29T08:01:19.000Z

and i also sometimes saw the same issues on jarvis labs ai page as well @deep-diver

Answer 12 · 2023-03-30T12:05:33.000Z

@deep-diver any idea on how to achieve proper functioning inside colab? where you able to run tests and check if the results were stable inside colab.
i think the instabilities in the UI translate to when we run on local machine too.
some instability in the chatbot UI.

Answer 13 · 2023-03-30T12:07:44.000Z

I just added cancel button

Answer 14 · 2023-03-30T12:08:28.000Z

ok that is great @deep-diver , i will run some tests and check how it functions now.

Answer 15 · 2023-03-30T20:10:45.000Z

I just added cancel button

@deep-diver can you also add a reset button? so that we can reset the chat , similar to how bing chat has reset option

Answer 16 · 2023-03-31T00:30:25.000Z

Sounds good! Will try

Answer 17 · 2023-04-03T11:21:07.000Z

@deep-diver any update on the reset button? similar to chatgpt , to reset the chat/start new conversation.

Answer 18 · 2023-04-03T11:24:10.000Z

Not yet

Sorry about that. I am currently busy at experimenting 65b model

Answer 19 · 2023-04-03T12:11:41.000Z

I am thinking about having history tab instead of reset. Like you login with google account

Answer 20 · 2023-04-03T19:06:37.000Z

@deep-diver is there a way to measure the inference speed, like how many tokens/sec on a given gpu for a given model.

Answer 21 · 2023-04-03T23:11:27.000Z

you can write your own code to simply count how many tokens are yielded for a certain amount of time window

Answer 22 · 2023-04-04T04:47:34.000Z

just added reset button

Answer 23 · 2023-04-13T07:15:59.000Z

@deep-diver which dataset did you use to train this model : chansung/gpt4-alpaca-lora-13b ?
is the dataset available on huggingface?

Answer 24 · 2023-04-13T07:21:05.000Z

Used GPT4 generated dataset introduced in the "Instruction Tuning with GPT-4" paper. You can find out the dataset in the official repo

Answer 25 · 2023-04-20T12:27:30.000Z

@deep-diver , I have a GPU with 40GB of VRAM.
If i run LLM-As-Chatbot with Llama 7B and AlpacaGPT4 Lora , how many instances can I run in parallel on a single GPU?

I don't want to make any queues, I want to check how many instances I can run on a single GPU at a time in parallel.

Hope you could shed some light onto this.

Answer 26 · 2023-04-20T12:29:25.000Z

Also @deep-diver , I'm currently running multiple instances in parallel on a single GPU by creating seperate docker containers for each instance of the chatbot (that way i get unique gradio links for each instance) , is there a better way to run multiple LLM-As-Chatbot instances in parallel on a single GPU?

Answer 27 · 2023-04-20T17:10:31.000Z

@GeorvityLabs

sorry, I am not sure about this question. Even if you containerize, I/O blocking should be still there because there is a single GPU globally available. Better solution would logically isolate a single GPU as if there are multiple physical GPUs.

I am going to close this issue for now. Please put anything that you are wondering in the Discussion menu. I think that is a better place :)