Starcoder conversion and quantization instructions

Question

Starcoder conversion and quantization instructions

aseok opened this issue 2 years ago · 4 comments

Hi.
Pls provide conversion and quantization instructions of the main starcoder model files.

Answer 1 · 2023-05-20T15:39:55.000Z

You can find the instructions here: https://github.com/bigcode-project/starcoder.cpp#quick-start

Answer 2 · 2023-05-23T10:01:21.000Z

I have downloaded model files separately and skip downloading them in convert-hf-to-ggml.py. my problem is in quantization and probably running inference, how to pass the model files in quantization command? Should I rename them?

Answer 3 · 2023-06-06T20:03:05.000Z

how to pass the model files in quantization command?

for the sharded model conversion don't pass the filenames, pass the directory:
$ python convert-hf-to-ggml.py ./starcoder

then quantization is as in the README example:
$ ./quantize starcoder-ggml.bin starcoder-ggml-q4_1.bin 3

Answer 4 · 2023-06-08T17:45:22.000Z

Does that fix your issue @aseok?