Model compression techniques with differentiable neural architecture search.
Currently, pruning and quantization are supported.
This project is implemented based on FBNet reproduced version.
Model compression techniques with differentiable neural architecture search.
Currently, pruning and quantization are supported.
This project is implemented based on FBNet reproduced version.