Memory compressed attention
lucidrains opened this issue · 0 comments
lucidrains commented
I have the memory compressed attention from the "Generating Wikipedia" paper https://github.com/lucidrains/memory-compressed-attention . Also, wanted to let you know there is a more complete implementation of linformer by Peter here https://github.com/tatp22/linformer-pytorch Thank you for compiling this!