How to support _Falcon-40b-Instruct.ggmlv3.q4_0.bin_ ?

Question

How to support _Falcon-40b-Instruct.ggmlv3.q4_0.bin_ ?

benoit-cty opened this issue a year ago · 8 comments

Thanks for this tool !

How can I use it with Falcon-40b-Instruct.ggmlv3.q4_0.bin ?

I can run Falcon-40b-Instruct.ggmlv3.q4_0.bin locally with:

git clone https://github.com/jploski/ggml falcon-ggml
cd falcon-ggml
git checkout falcon40b
mkdir build && cd build && cmake .. && cmake --build . --config Release
bin/falcon -m ~/work/Falcon-40b-Instruct.ggmlv3.q4_0.bin -t 10 -n 200 -p "Hello AGI"

Answer 1 · 2023-06-25T21:43:45.000Z

That should be possible via CTransformers, which is currently broken. Please close this issue and refer to issue 309.

Answer 2 · 2023-06-26T07:28:32.000Z

Thanks, waiting for #309

Answer 3 · 2023-06-26T20:09:02.000Z

All falcon models are now supported on lollms-webui through ctransformers bindings. You can either install the models I have put in the ui or just use the url install if it is not on the zoo. Just grub the link to the .bin file from a repo like @TheBloke's hugging face, paste it in url and press install.
The ctransformers binding supports loads of models. It is my favorite.

Answer 4 · 2023-06-27T13:06:28.000Z

Yes, it works, thanks ! But seems too slow for the UI : #310

Answer 5 · 2023-06-28T06:21:54.000Z

If you use 40B, it is big for most gpus so many parts are handled by cpu. Use the 7B model. With gpu support that works fast.