/shortened-llm

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Primary LanguagePython

Watchers