htcml opened this issue 2 years ago · 0 comments
Are you able to add quantization code so that the model can be run on a smaller GPU?