attention-is-all-you-need

There are 221 repositories under attention-is-all-you-need topic.

jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Language:Python9k 97 1812k
Kyubyong/transformer
A TensorFlow Implementation of the Transformer: Attention Is All You Need
Language:Python4.3k 110 1601.3k
linto-ai/whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Language:Python2.2k 32 158162
awslabs/sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
Language:Python1.2k 50 311323
gordicaleksa/pytorch-original-transformer
My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
Language:Jupyter Notebook1k 30 9175
Separius/awesome-fast-attention
list of efficient attention modules
Language:Python996 32 3108
brightmart/bert_language_understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Language:Python963 49 21211
kaituoxu/Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Language:Python776 30 40196
hkproj/pytorch-transformer
Attention is all you need implementation
Language:Jupyter Notebook718 12 24292
lsdefine/attention-is-all-you-need-keras
A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need
Language:Python712 26 35187
kyegomez/LongNet
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Language:Python692 19 2264
sooftware/kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Language:Python608 21 135192
FreedomIntelligence/TextClassificationBenchmark
A Benchmark of Text Classification in PyTorch
Language:Python602 32 25137
jayparks/transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
Language:Python550 7 17122
lvapeab/nmt-keras
Neural Machine Translation with Keras
Language:Python532 26 123128
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python397 5 2129
kyegomez/CM3Leon
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Language:Python363 22 1518
kyegomez/ScreenAI
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
Language:Python308 9 630
sled-group/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
Language:Python304 5 1910
kyegomez/PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
Language:Python280 5 1446
sgrvinod/a-PyTorch-Tutorial-to-Transformers
Attention Is All You Need | a PyTorch Tutorial to Transformers
Language:Python280 7 548
leviswind/pytorch-transformer
pytorch implementation of Attention is all you need
Language:Python240 3 956
hkproj/transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
234 3 153
brandokoch/attention-is-all-you-need-paper
Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
Language:Jupyter Notebook231 3 047
kyegomez/RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
Language:Python186 8 620
kyegomez/MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
Language:Python178 4 517
Shuijing725/CrowdNav_Prediction_AttnGraph
[ICRA 2023] Intention Aware Robot Crowd Navigation with Attention-Based Interaction Graph
Language:Python178 5 1831
kyegomez/Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
Language:Python153 6 512
guillaume-chevalier/Linear-Attention-Recurrent-Neural-Network
A recurrent attention module consisting of an LSTM cell which can query its own past cell states by the means of windowed multi-head attention. The formulas are derived from the BN-LSTM and the Transformer Network. The LARNN cell with attention can be easily used inside a loop on the cell state, just like any other RNN. (LARNN)
Language:Jupyter Notebook144 9 033
OutofAi/StableFace
Build your own Face App with Stable Diffusion 2.1
Language:Jupyter Notebook144 2 014
tnq177/transformers_without_tears
Transformers without Tears: Improving the Normalization of Self-Attention
Language:Python130 8 417
jshuadvd/LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
Language:Python125 6 314
alexivaner/Deep-Learning-Based-Radio-Signal-Classification
Final Project for AI Wireless
Language:Jupyter Notebook100 3 426
whr94621/NJUNMT-pytorch
Language:Python94 13 1531
johnsmithm/multi-heads-attention-image-classification
Multi heads attention for image classification
Language:Python81 4 435
kyegomez/Kosmos2.5
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Language:Python70 2 26

attention-is-all-you-need

jadore801120/attention-is-all-you-need-pytorch

Kyubyong/transformer

linto-ai/whisper-timestamped

awslabs/sockeye

gordicaleksa/pytorch-original-transformer

Separius/awesome-fast-attention

brightmart/bert_language_understanding

kaituoxu/Speech-Transformer

hkproj/pytorch-transformer

lsdefine/attention-is-all-you-need-keras

kyegomez/LongNet

sooftware/kospeech

FreedomIntelligence/TextClassificationBenchmark

jayparks/transformer

lvapeab/nmt-keras

feifeibear/long-context-attention

kyegomez/CM3Leon

kyegomez/ScreenAI

sled-group/InfEdit

kyegomez/PALM-E

sgrvinod/a-PyTorch-Tutorial-to-Transformers

leviswind/pytorch-transformer

hkproj/transformer-from-scratch-notes

brandokoch/attention-is-all-you-need-paper

kyegomez/RT-X

kyegomez/MambaTransformer

Shuijing725/CrowdNav_Prediction_AttnGraph

kyegomez/Jamba

guillaume-chevalier/Linear-Attention-Recurrent-Neural-Network

OutofAi/StableFace

tnq177/transformers_without_tears

jshuadvd/LongRoPE

alexivaner/Deep-Learning-Based-Radio-Signal-Classification

whr94621/NJUNMT-pytorch

johnsmithm/multi-heads-attention-image-classification

kyegomez/Kosmos2.5