Apply model compression techniques for improved throughput
tallamjr opened this issue · 0 comments
tallamjr commented
The goal is to apply techniques found in https://www.tensorflow.org/model_optimization/api_docs/python/tfmot to improve throughput and latency of inference for t2
.