vit

There are 332 repositories under vit topic.

lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python12.8k 74 2721k
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.7k 129 30489
towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Language:Python3.2k 29 665253
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Language:Jupyter Notebook1.8k 21 63241
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Language:Python1.4k 10 209193
roboflow/inference
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Language:Python1.4k 23 135130
BR-IDL/PaddleViT
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
Language:Python1.2k 10 109319
yitu-opensource/T2T-ViT
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Language:Jupyter Notebook1.1k 18 76176
Yangzhangcst/Transformer-in-Computer-Vision
A paper list of some recent Transformer-based CV works.
1.1k 40 5138
sail-sg/Adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Language:Python761 7 3564
v-iashin/video_features
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Language:Python536 6 7597
chinhsuanwu/mobilevit-pytorch
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
Language:Python504 5 1770
zgcr/SimpleAICV_pytorch_training_examples
SimpleAICV:pytorch training and testing examples.
Language:Python421 9 3295
vatz88/FFCSonTheGo
FFCS course registration made hassle free for VITians. Search courses and visualize the timetable on the go!
Language:JavaScript291 6 5483
gupta-abhay/pytorch-vit
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Language:Python287 9 1234
PaddlePaddle/PASSL
PASSL包含 SimCLR，MoCo v1/v2，BYOL，CLIP，PixPro，simsiam, SwAV, BEiT，MAE 等图像自监督算法以及 Vision Transformer，DEiT，Swin Transformer，CvT，T2T-ViT，MLP-Mixer，XCiT，ConvNeXt，PVTv2 等基础视觉算法
Language:Python276 11 3065
eeyhsong/EEG-Transformer
i. A practical application of Transformer (ViT) on 2-D physiological signal (EEG) classification tasks. Also could be tried with EMG, EOG, ECG, etc. ii. Including the attention of spatial dimension (channel attention) and *temporal dimension*. iii. Common spatial pattern (CSP), an efficient feature enhancement method, realized with Python.
Language:Python267 3 1129
megvii-research/RevCol
Official Code of Paper "Reversible Column Networks" "RevColv2"
Language:Python250 12 2010
qanastek/HugsVision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Language:Jupyter Notebook195 5 4021
kyegomez/NaViT
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Language:Python185 7 510
xmindflow/Awesome-Transformer-in-Medical-Imaging
[MedIA Journal] An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
178 2 124
SkyworkAI/MoH
MoH: Multi-Head Attention as Mixture-of-Head Attention
Language:Python157 3 15
zwcolin/EEG-Transformer
A ViT based transformer applied on multi-channel time-series EEG data for motor imagery classification
Language:Python157 2 219
implus/mae_segmentation
reproduction of semantic segmentation using masked autoencoder (mae)
Language:Python156 3 714
yaoxiaoyuan/mimix
Mimix: A Text Generation Tool and Pretrained Chinese Models
Language:Python152 3 2017
PaddlePaddle/PLSC
Paddle Large Scale Classification Tools，supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Language:Python150 21 4335
kyegomez/Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
Language:Python143 5 613
hunto/LightViT
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
Language:Python137 2 710
s-chh/PyTorch-Scratch-Vision-Transformer-ViT
Simple and easy to understand PyTorch implementation of Vision Transformer (ViT) from scratch with detailed steps. Tested on small datasets: MNIST, FashionMNIST, SVHN, CIFAR10, and CIFAR100.
Language:Python120 2 317
jaehyunnn/ViTPose_pytorch
An unofficial implementation of ViTPose [Y. Xu et al., 2022]
Language:Jupyter Notebook106 1 1821
vitjs/vit
🚀 React application framework inspired by UmiJS / 类 UmiJS 的 React 应用框架
Language:TypeScript100 3 87
kamalkraj/Vision-Transformer
Vision Transformer using TensorFlow 2.0
Language:Python96 4 319
DefTruth/Awesome-SD-Inference
📖A small curated list of Awesome SD/DiT/ViT/Diffusion Inference with Distributed/Caching/Sampling: DistriFusion, PipeFusion, AsyncDiff, DeepCache, Block Caching etc.
93 5 04
rasbt/pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog post.
Language:Python86 4 111
daniel-code/TubeViT
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
Language:Python85 10 139
zubair-irshad/NeRF-MAE
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Language:Python82 8 23

vit

lukas-blecher/LaTeX-OCR

cmhungsteve/Awesome-Transformer-Attention

towhee-io/towhee

hila-chefer/Transformer-Explainability

open-compass/VLMEvalKit

roboflow/inference

BR-IDL/PaddleViT

yitu-opensource/T2T-ViT

Yangzhangcst/Transformer-in-Computer-Vision

sail-sg/Adan

v-iashin/video_features

chinhsuanwu/mobilevit-pytorch

zgcr/SimpleAICV_pytorch_training_examples

vatz88/FFCSonTheGo

gupta-abhay/pytorch-vit

PaddlePaddle/PASSL

eeyhsong/EEG-Transformer

megvii-research/RevCol

qanastek/HugsVision

kyegomez/NaViT

xmindflow/Awesome-Transformer-in-Medical-Imaging

SkyworkAI/MoH

zwcolin/EEG-Transformer

implus/mae_segmentation

yaoxiaoyuan/mimix

PaddlePaddle/PLSC

kyegomez/Vit-RGTS

hunto/LightViT

s-chh/PyTorch-Scratch-Vision-Transformer-ViT

jaehyunnn/ViTPose_pytorch

vitjs/vit

kamalkraj/Vision-Transformer

DefTruth/Awesome-SD-Inference

rasbt/pytorch-memory-optim

daniel-code/TubeViT

zubair-irshad/NeRF-MAE