transformer-architecture

There are 218 repositories under transformer-architecture topic.

  • Plachtaa/VALL-E-X

    An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

    Language:Python7.3k82148732
  • cmhungsteve/Awesome-Transformer-Attention

    An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

  • tairov/llama2.mojo

    Inference Llama 2 in one file of pure 🔥

    Language:Mojo2k2744139
  • spago

    nlpodyssey/spago

    Self-contained Machine Learning and Natural Language Processing library in Go

    Language:Go1.7k395686
  • awslabs/sockeye

    Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

    Language:Python1.2k51311327
  • Ma-Lab-Berkeley/CRATE

    Code for CRATE (Coding RAte reduction TransformEr).

    Language:Python1.1k201782
  • berniwal/swin-transformer-pytorch

    Implementation of the Swin Transformer in PyTorch.

    Language:Python7491025125
  • joeynmt

    joeynmt/joeynmt

    Minimalist NMT for educational purposes

    Language:Python6611695213
  • fastnlp/CPT

    CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

    Language:Python47357973
  • cuiziteng/Illumination-Adaptive-Transformer

    [BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.

    Language:Python43657244
  • google-research/maxvit

    [ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...

    Language:Jupyter Notebook42192028
  • kyegomez/MultiModalMamba

    A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

    Language:Python4118321
  • wgcban/ChangeFormer

    [IGARSS'22]: A Transformer-Based Siamese Network for Change Detection

    Language:Python39339854
  • abdur75648/Deep-Learning-Specialization-Coursera

    This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.

    Language:Jupyter Notebook37754332
  • wjf5203/SeqFormer

    SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)

    Language:Python33972431
  • linwhitehat/ET-BERT

    The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.

    Language:Python29548270
  • PengBoXiangShang/multigraph_transformer

    IEEE TNNLS 2021, transformer, multi-graph transformer, graph, graph classification, sketch recognition, sketch classification, free-hand sketch, official code of the paper "Multi-Graph Transformer for Free-Hand Sketch Recognition"

    Language:Python2927532
  • ZixuanKe/PyContinual

    PyContinual (An Easy and Extendible Framework for Continual Learning)

    Language:Python28272361
  • UIC-Liu-Lab/ContinualLM

    An Extensible Continual Learning Framework Focused on Language Models (LMs)

    Language:Python222101016
  • ernie

    labteral/ernie

    Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.

    Language:Python19971429
  • zhongkaifu/Seq2SeqSharp

    Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.

    Language:C#193235739
  • prakhar21/TextAugmentation-GPT2

    Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.

    Language:Python18771043
  • sgrvinod/a-PyTorch-Tutorial-to-Transformers

    Attention Is All You Need | a PyTorch Tutorial to Transformers

    Language:Python1836533
  • VSainteuf/pytorch-psetae

    PyTorch implementation of the model presented in "Satellite Image Time Series Classification with Pixel-Set Encoders and Temporal Self-Attention"

    Language:Python16932234
  • hkproj/transformer-from-scratch-notes

    Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)

  • miccaiif/TransMEF

    Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via Self-Supervised Multi-Task Learning.

    Language:Python14872016
  • jcwang123/BA-Transformer

    [MICCAI 2021] Boundary-aware Transformers for Skin Lesion Segmentation

    Language:Python11511521
  • quanghuy0497/Transformers4Vision

    A summarization of Transformer-based architectures for CV tasks, including image classification, object detection, segmentation, and Few-shot Learning. Keep updated frequently.

  • ra1ph2/Vision-Transformer

    Implementation of Vision Transformer from scratch and performance compared to standard CNNs (ResNets) and pre-trained ViT on CIFAR10 and CIFAR100.

    Language:Jupyter Notebook92119
  • vilari-mickopf/mmwave-gesture-recognition

    Basic Gesture Recognition Using mmWave Sensor - TI AWR1642

    Language:Python8841118
  • LongRoPE

    jshuadvd/LongRoPE

    Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

    Language:Python82528
  • kyegomez/Algorithm-Of-Thoughts

    My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"

    Language:Python772415
  • jet-universe/particle_transformer

    Official implementation of "Particle Transformer for Jet Tagging".

    Language:Python735244
  • shamim-hussain/egt_pytorch

    Edge-Augmented Graph Transformer

    Language:Python68459
  • UARK-AICV/VLTinT

    [AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

    Language:Jupyter Notebook644146
  • szq0214/SReT

    Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"

    Language:Python626311