attention-mechanisms

There are 93 repositories under attention-mechanisms topic.

lucidrains/PaLM-rlhf-pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Language:Python7.9k 139 48682
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.9k 129 31496
lucidrains/musiclm-pytorch
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
Language:Python3.3k 95 55263
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.6k 58 180277
lucidrains/toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Language:Python2k 37 17125
lucidrains/make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Language:Python2k 68 16190
lucidrains/alphafold3-pytorch
Implementation of Alphafold 3 from Google Deepmind in Pytorch
Language:Python1.5k 46 56201
pprp/awesome-attention-mechanism-in-cv
Awesome List of Attention Modules and Plug&Play Modules in Computer Vision
Language:Python1.2k 14 4171
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Language:Python910 33 3687
lucidrains/meshgpt-pytorch
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
Language:Python849 17 7670
lucidrains/phenaki-pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
Language:Python780 37 3281
kyegomez/LongNet
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Language:Python711 17 2264
JulesBelveze/time-series-autoencoder
PyTorch Dual-Attention LSTM-Autoencoder For Multivariate Time Series
Language:Python687 5 2566
lucidrains/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Language:Python651 10 1755
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Language:Python632 27 3634
lucidrains/BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
Language:Python616 14 3523
lucidrains/iTransformer
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks, out of Tsinghua / Ant group
Language:Python515 7 2842
changzy00/pytorch-attention
🦖Pytorch implementation of popular Attention Mechanisms, Vision Transformers, MLP-Like models and CNNs.🔥🔥🔥
Language:Python505 4 148
lucidrains/local-attention
An implementation of local windowed attention for language modeling
Language:Python475 6 2150
lucidrains/robotic-transformer-pytorch
Implementation of RT1 (Robotic Transformer) in Pytorch
Language:Python441 9 734
lucidrains/mmdit
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Language:Python440 3 812
landskape-ai/triplet-attention
Official PyTorch Implementation for "Rotate to Attend: Convolutional Triplet Attention Module." [WACV 2021]
Language:Jupyter Notebook430 10 2650
lucidrains/recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Language:Python414 13 2016
lucidrains/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
Language:Python370 6 1421
lucidrains/medical-chatgpt
Implementation of ChatGPT, but tailored towards primary care medicine, with the reward being able to collect patient histories in a thorough and efficient manner and come up with a reasonable differential diagnosis
Language:Python314 31 430
lucidrains/equiformer-pytorch
Implementation of the Equiformer, SE3/E3 equivariant attention network that reaches new SOTA, and adopted for use by EquiFold for protein folding
Language:Python274 12 1427
cbaziotis/neat-vision
Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
Language:Vue251 6 524
lucidrains/CoLT5-attention
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Language:Python229 7 713
vene/sparse-structured-attention
Sparse and structured neural attention mechanisms
Language:Python224 6 836
lucidrains/block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
Language:Python220 7 620
GiantPandaCV/yolov3-point
Learning YOLOv3 from scratch 从零开始学习YOLOv3代码
Language:Jupyter Notebook216 3 855
lucidrains/flash-cosine-sim-attention
Implementation of fused cosine similarity attention in the same style as Flash Attention
Language:Cuda212 12 1012
lucidrains/simple-hierarchical-transformer
Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT
Language:Python210 9 413
lucidrains/flash-attention-jax
Implementation of Flash Attention in Jax
Language:Python206 4 1124
lucidrains/Mega-pytorch
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
Language:Python203 8 211
lucidrains/recurrent-interface-network-pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch
Language:Python203 11 1715

attention-mechanisms

lucidrains/PaLM-rlhf-pytorch

cmhungsteve/Awesome-Transformer-Attention

lucidrains/musiclm-pytorch

lucidrains/audiolm-pytorch

lucidrains/toolformer-pytorch

lucidrains/make-a-video-pytorch

lucidrains/alphafold3-pytorch

pprp/awesome-attention-mechanism-in-cv

lucidrains/muse-maskgit-pytorch

lucidrains/meshgpt-pytorch

lucidrains/phenaki-pytorch

kyegomez/LongNet

JulesBelveze/time-series-autoencoder

lucidrains/MEGABYTE-pytorch

lucidrains/magvit2-pytorch

lucidrains/BS-RoFormer

lucidrains/iTransformer

changzy00/pytorch-attention

lucidrains/local-attention

lucidrains/robotic-transformer-pytorch

lucidrains/mmdit

landskape-ai/triplet-attention

lucidrains/recurrent-memory-transformer-pytorch

lucidrains/q-transformer

lucidrains/medical-chatgpt

lucidrains/equiformer-pytorch

cbaziotis/neat-vision

lucidrains/CoLT5-attention

vene/sparse-structured-attention

lucidrains/block-recurrent-transformer-pytorch

GiantPandaCV/yolov3-point

lucidrains/flash-cosine-sim-attention

lucidrains/simple-hierarchical-transformer

lucidrains/flash-attention-jax

lucidrains/Mega-pytorch

lucidrains/recurrent-interface-network-pytorch