ShearedCodeLLama

Question

ShearedCodeLLama

Closed this issue 10 months ago · 3 comments

Hi! I am working on a copilot backend and, even though I am using a GPTQ quant of codellama7b, it is still eating lots of VRAM
DeepSeek coder seems to have severe issues understanding fill in the middle

I wanted to ask if you plan on also shearing CodeLlama? :)

Answer 1 · 2023-12-07T06:50:59.000Z

I also want to shearing codellama, do you success it?

Answer 2 · 2023-12-07T09:44:33.000Z

@YanxiZSQ
It would cost around 2k given this estimate #22

Answer 3 · 2023-12-10T17:41:39.000Z

I just fixed my deepseek prompt and I have to say they are very great models, closing!