microsoft/TransformerCompression

Quarot: DeepSeek-V2 Support

RanchiZhao opened this issue · 0 comments

especially for MLA, how to put Q on MLA?