TensorRT saved model too large to use with TFServing

Question

TensorRT saved model too large to use with TFServing

bharatv007 opened this issue 4 years ago · 4 comments

Versions:
Tensorflow- 2.3.0-rc1
CUDA-10
TensorRT-6
I am trying to convert a GPT2 model, the saved model size is about 1.9GB. It causes an issue when I try to use TF serving for deployment as it hits a protobuf limit of 1 GB. I have tried to not build TRT engines before deployement too, but it did not affect the size of the saved_model.pb.

Answer 1 · 2020-07-24T01:59:49.000Z

CC @bixia1

Answer 2 · 2020-07-30T15:02:06.000Z

Any updates on this? @sanjoy @bixia1

Answer 3 · 2021-03-03T03:50:51.000Z

I met the same problem on tf-2.4.1

Answer 4 · 2021-07-29T11:05:33.000Z

I also met this problem