Memory compressed attention

Question

lucidrains opened this issue 4 years ago · 0 comments

I have the memory compressed attention from the "Generating Wikipedia" paper https://github.com/lucidrains/memory-compressed-attention . Also, wanted to let you know there is a more complete implementation of linformer by Peter here https://github.com/tatp22/linformer-pytorch Thank you for compiling this!