justairr opened this issue a year ago · 1 comments
In the Quantization part, the hyperlink to the paper 'FlexRound' and 'Understanding INT4 Quantization for Transformer Models' is incorrect.
Thanks for pointing this out! We have replaced it with the correct hyperlink accordingly.