linear-attention

There are 17 repositories under linear-attention topic.

BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.9k 132 224875
happinesslz/LION
[NeurIPS 2024] Official code of ”LION: Linear Group RNN for 3D Object Detection in Point Clouds“
Language:Python143 2 2610
lucidrains/taylor-series-linear-attention
Explorations into the recently proposed Taylor Series Linear Attention
Language:Python90 2 43
lucidrains/agent-attention-pytorch
Implementation of Agent Attention in Pytorch
Language:Python88 3 34
lironui/Multi-Attention-Network
The semantic segmentation of remote sensing images
Language:Python75 4 77
lironui/MAResU-Net
The semantic segmentation of remote sensing images
Language:Python45 3 22
lucidrains/autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
Language:Python43 4 03
glassroom/heinsen_attention
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
Language:Python24 3 11
gmongaras/Cottention_Transformer
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
Language:Cuda13 1 10
BICLab/MetaLA
Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024)
Language:Python12
robflynnyh/hydra-linear-attention
Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)
Language:Python12 1 10
OSU-STARLAB/LeaPformer
[ICML 2024] Official implementation of "LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions."
Language:Python9 1 01
RWKV-Wiki/rwkv-wiki.github.io
RWKV Wiki website (archived, please visit official wiki)
Language:Shell9 3 10
hp-l33/flash-bidirectional-linear-attention
Triton implement of bi-directional (non-causal) linear attention
Language:Python81
gmlwns2000/sea-attention
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
Language:Python7 4 01
mtanghu/LEAP
LEAP: Linear Explainable Attention in Parallel for causal language modeling with O(1) path length, and O(1) inference
Language:Jupyter Notebook4 2 160
Rushi314/Transformers-for-high-resolution-image-synthesis
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook0 1 00

linear-attention

BlinkDL/RWKV-LM

happinesslz/LION

lucidrains/taylor-series-linear-attention

lucidrains/agent-attention-pytorch

lironui/Multi-Attention-Network

lironui/MAResU-Net

lucidrains/autoregressive-linear-attention-cuda

glassroom/heinsen_attention

gmongaras/Cottention_Transformer

BICLab/MetaLA

robflynnyh/hydra-linear-attention

OSU-STARLAB/LeaPformer

RWKV-Wiki/rwkv-wiki.github.io

hp-l33/flash-bidirectional-linear-attention

gmlwns2000/sea-attention

mtanghu/LEAP

Rushi314/Transformers-for-high-resolution-image-synthesis