adriacabeza/erudito

Can we get this to run on OOba 4bit quantized models?

bbecausereasonss opened this issue · 0 comments

That would be amazing :)