multi-head-attention

There are 37 repositories under multi-head-attention topic.

sooftware/attentions
PyTorch implementation of some attentions for Deep Learning Researchers.
Language:Python511 3 470
imperial-qore/TranAD
[VLDB'22] Anomaly Detection using Transformers, self-conditioning and adversarial training.
Language:Python507 11 0153
anicolson/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Language:MATLAB497 25 49127
poloclub/dodrio
Exploring attention weights in transformer-based models with linguistic knowledge.
Language:Svelte342 6 930
Rintarooo/VRP_DRL_MHA
"Attention, Learn to Solve Routing Problems!"[Kool+, 2019], Capacitated Vehicle Routing Problem solver
Language:Python168 2 036
monk1337/Various-Attention-mechanisms
This repository contain various types of attention mechanism like Bahdanau , Soft attention , Additive Attention , Hierarchical Attention etc in Pytorch, Tensorflow, Keras
Language:Python123 6 125
datnnt1997/multi-head_self-attention
A Faster Pytorch Implementation of Multi-Head Self-Attention
Language:Jupyter Notebook71 1 114
zhaocq-nlp/Attention-Visualization
Visualization for simple attention and Google's multi-head attention.
Language:Java68 3 115
youngbin-ro/Multi2OIE
Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)
Language:Python57 5 619
ShaneTian/Att-Induction
Attention-based Induction Networks for Few-Shot Text Classification
Language:Python45 2 97
JacobHanimann/scDINO
Self-Supervised Vision Transformers for multiplexed imaging datasets
Language:Python42 5 58
knotgrass/attention
several types of attention modules written in PyTorch
Language:Python37 3 09
engelnico/point-transformer
This is the official repository of the original Point Transformer architecture.
Language:Python36 3 17
IParraMartin/An-Explanation-Is-All-You-Need
The original transformer implementation from scratch. It contains informative comments on each block
Language:Python27 1 00
Bruce-Lee-LY/flash_attention_inference
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
Language:C++20 1 22
shifop/datagrand_bert
2019达观杯信息提取第5名代码
Language:Python20 2 25
jack57lee/Diversify-MHA
EMNLP 2018: Multi-Head Attention with Disagreement Regularization; NAACL 2019: Information Aggregation for Multi-Head Attention with Routing-by-Agreement
Language:Python19 2 33
Zminghua/SentEncoding
Sentence encoder and training code for Mean-Max AAE
Language:Python16 4 36
Bruce-Lee-LY/decoding_attention
Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference.
Language:C++15 2 01
M-e-r-c-u-r-y/pytorch-transformers
Collection of different types of transformers for learning purposes
Language:Jupyter Notebook13 1 20
shreyas-kowshik/nlp4if
Code for the runners up entry on the English subtask on the Shared-Task-On-Fighting the COVID-19 Infodemic, NLP4IF workshop, NAACL'21.
Language:Python6 1 00
tranquoctrinh/Image-Captioning-EfficientNet-Transformer
Image Captioning with Encoder as Efficientnet and Decoder as Decoder of Transformer combined with the attention mechanism.
Language:Python6 1 00
YigitTurali/HydraViT
HydraViT is a PyTorch implementation of the HydraViT model, an adaptive multi-branch transformer for multi-label disease classification from chest X-ray images. The repository provides the necessary code to train and evaluate the HydraViT model on the NIH Chest X-ray dataset.
5 1 11
pi-tau/transformer
The Transformer model implemented from scratch using PyTorch. The model uses weight sharing between the embedding layers and the pre-softmax linear layer. Training on the Multi30k machine translation task is shown.
Language:Python4 2 10
tanishqgautam/Transformers
Pytorch Implementation of Transformers
Language:Python4 1 01
AIMedLab/DeepCE
Code and Datasets for the paper "A deep learning framework for high-throughput mechanism-driven phenotype compound screening and its application to COVID-19 drug repurposing", published on Nature Machine Intelligence in 2021.
Language:Python3 3 02
gazelle93/Attention-Various-Positional-Encoding
This project aims to implement the Scaled-Dot-Product Attention layer and the Multi-Head Attention layer using various Positional Encoding methods.
Language:Python3 1 10
dev-geof/final-state-transformer
Machine learning development toolkit built upon Transformer encoder network architectures and tailored for the realm of high-energy physics and particle-collision event analysis.
Language:Python2
liaoyanqing666/transformer_pytorch
完整的原版transformer程序，complete origin transformer program
Language:Python2 1 00
navreeetkaur/learn-to-pay-attention
TensorFlow implementation of AlexNet with multi-headed Attention mechanism
Language:Jupyter Notebook1 4 01
SpydazWebAI-NLP/BasicNeuralNetWork2023
A Basic Multi layered Neural Network, With Attention Masking Features
Language:Visual Basic .NET1 2 01
young-zonglin/yangzl-deep-text-matching
Text matching using several deep models.
Language:Python1 2 11
sushantkumar23/nano-gpt
Simple character level Transformer
Language:Jupyter Notebook0 1 00
tate8/translator
Transformer translator website with multithreaded web server in Rust
Language:Rust0 1 00
TmohamedashrafT/vision-transformer-implementation
This repository contains code for implementing Vision Transformer (ViT) model for image classification
Language:Python0 2 00
sajith-rahim/transformer-classifier
A Transformer Classifier implemented from Scratch.
Language:Python1 0

multi-head-attention

sooftware/attentions

imperial-qore/TranAD

anicolson/DeepXi

poloclub/dodrio

Rintarooo/VRP_DRL_MHA

monk1337/Various-Attention-mechanisms

datnnt1997/multi-head_self-attention

zhaocq-nlp/Attention-Visualization

youngbin-ro/Multi2OIE

ShaneTian/Att-Induction

JacobHanimann/scDINO

knotgrass/attention

engelnico/point-transformer

IParraMartin/An-Explanation-Is-All-You-Need

Bruce-Lee-LY/flash_attention_inference

shifop/datagrand_bert

jack57lee/Diversify-MHA

Zminghua/SentEncoding

Bruce-Lee-LY/decoding_attention

M-e-r-c-u-r-y/pytorch-transformers

shreyas-kowshik/nlp4if

tranquoctrinh/Image-Captioning-EfficientNet-Transformer

YigitTurali/HydraViT

pi-tau/transformer

tanishqgautam/Transformers

AIMedLab/DeepCE

gazelle93/Attention-Various-Positional-Encoding

dev-geof/final-state-transformer

liaoyanqing666/transformer_pytorch

navreeetkaur/learn-to-pay-attention

SpydazWebAI-NLP/BasicNeuralNetWork2023

young-zonglin/yangzl-deep-text-matching

sushantkumar23/nano-gpt

tate8/translator

TmohamedashrafT/vision-transformer-implementation

sajith-rahim/transformer-classifier