transformer-architecture

There are 218 repositories under transformer-architecture topic.

Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Language:Python7.3k 82 148732
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.3k 122 24477
tairov/llama2.mojo
Inference Llama 2 in one file of pure 🔥
Language:Mojo2k 27 44139
nlpodyssey/spago
Self-contained Machine Learning and Natural Language Processing library in Go
Language:Go1.7k 39 5686
awslabs/sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
Language:Python1.2k 51 311327
Ma-Lab-Berkeley/CRATE
Code for CRATE (Coding RAte reduction TransformEr).
Language:Python1.1k 20 1782
berniwal/swin-transformer-pytorch
Implementation of the Swin Transformer in PyTorch.
Language:Python749 10 25125
joeynmt/joeynmt
Minimalist NMT for educational purposes
Language:Python661 16 95213
fastnlp/CPT
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
Language:Python473 5 7973
cuiziteng/Illumination-Adaptive-Transformer
[BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.
Language:Python436 5 7244
google-research/maxvit
[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
Language:Jupyter Notebook421 9 2028
kyegomez/MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
Language:Python411 8 321
wgcban/ChangeFormer
[IGARSS'22]: A Transformer-Based Siamese Network for Change Detection
Language:Python393 3 9854
abdur75648/Deep-Learning-Specialization-Coursera
This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.
Language:Jupyter Notebook377 5 4332
wjf5203/SeqFormer
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
Language:Python339 7 2431
linwhitehat/ET-BERT
The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.
Language:Python295 4 8270
PengBoXiangShang/multigraph_transformer
IEEE TNNLS 2021, transformer, multi-graph transformer, graph, graph classification, sketch recognition, sketch classification, free-hand sketch, official code of the paper "Multi-Graph Transformer for Free-Hand Sketch Recognition"
Language:Python292 7 532
ZixuanKe/PyContinual
PyContinual (An Easy and Extendible Framework for Continual Learning)
Language:Python282 7 2361
UIC-Liu-Lab/ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs)
Language:Python222 10 1016
labteral/ernie
Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
Language:Python199 7 1429
zhongkaifu/Seq2SeqSharp
Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.
Language:C#193 23 5739
prakhar21/TextAugmentation-GPT2
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Language:Python187 7 1043
sgrvinod/a-PyTorch-Tutorial-to-Transformers
Attention Is All You Need | a PyTorch Tutorial to Transformers
Language:Python183 6 533
VSainteuf/pytorch-psetae
PyTorch implementation of the model presented in "Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention"
Language:Python169 3 2234
hkproj/transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
167 3 139
miccaiif/TransMEF
Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via Self-Supervised Multi-Task Learning.
Language:Python148 7 2016
jcwang123/BA-Transformer
[MICCAI 2021] Boundary-aware Transformers for Skin Lesion Segmentation
Language:Python115 1 1521
quanghuy0497/Transformers4Vision
A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-shot Learning. Keep updated frequently.
102 2 018
ra1ph2/Vision-Transformer
Implementation of Vision Transformer from scratch and performance compared to standard CNNs (ResNets) and pre-trained ViT on CIFAR10 and CIFAR100.
Language:Jupyter Notebook92 1 19
vilari-mickopf/mmwave-gesture-recognition
Basic Gesture Recognition Using mmWave Sensor - TI AWR1642
Language:Python88 4 1118
jshuadvd/LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
Language:Python82 5 28
kyegomez/Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
Language:Python77 2 415
jet-universe/particle_transformer
Official implementation of "Particle Transformer for Jet Tagging".
Language:Python73 5 244
shamim-hussain/egt_pytorch
Edge-Augmented Graph Transformer
Language:Python68 4 59
UARK-AICV/VLTinT
[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Language:Jupyter Notebook64 4 146
szq0214/SReT
Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"
Language:Python62 6 311

transformer-architecture

Plachtaa/VALL-E-X

cmhungsteve/Awesome-Transformer-Attention

tairov/llama2.mojo

nlpodyssey/spago

awslabs/sockeye

Ma-Lab-Berkeley/CRATE

berniwal/swin-transformer-pytorch

joeynmt/joeynmt

fastnlp/CPT

cuiziteng/Illumination-Adaptive-Transformer

google-research/maxvit

kyegomez/MultiModalMamba

wgcban/ChangeFormer

abdur75648/Deep-Learning-Specialization-Coursera

wjf5203/SeqFormer

linwhitehat/ET-BERT

PengBoXiangShang/multigraph_transformer

ZixuanKe/PyContinual

UIC-Liu-Lab/ContinualLM

labteral/ernie

zhongkaifu/Seq2SeqSharp

prakhar21/TextAugmentation-GPT2

sgrvinod/a-PyTorch-Tutorial-to-Transformers

VSainteuf/pytorch-psetae

hkproj/transformer-from-scratch-notes

miccaiif/TransMEF

jcwang123/BA-Transformer

quanghuy0497/Transformers4Vision

ra1ph2/Vision-Transformer

vilari-mickopf/mmwave-gesture-recognition

jshuadvd/LongRoPE

kyegomez/Algorithm-Of-Thoughts

jet-universe/particle_transformer

shamim-hussain/egt_pytorch

UARK-AICV/VLTinT

szq0214/SReT