granularai/fabric

Model compression

sagarverma opened this issue · 0 comments

Compress model using

  1. Once for all for variable deployment.
  2. Filter pruning methods
  3. Quantization
  4. CI-CD for CPP converstion