Image Diffusion block merging technique applied to transformers based Language Models.
Primary LanguagePython