Model size are not reduced after simplification
junde-cadence opened this issue · 2 comments
junde-cadence commented
I tried to simplify TinyLlama with the code, but the simplified onnx file is almost with the same size with non-simplified one. It is appreciated if you can provide onnx sizes of the original Llama onnx model and the one after simplification.
luchangli03 commented
maybe you export onnx model without using const folding
luchangli03 commented
please try the new code