lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
PythonMIT
Issues
- 5
train navit nest 3d error when backward
#332 opened by HaloTrouvaille - 1
RegionViT - Local token embedding
#330 opened by minhquoc0712 - 1
The Total params: and Params size (MB) of the model printed by summary are different from the bit_base model in timm library. Theoretically, the same settings should be the same. What is the reason?
#329 opened by lucker26 - 1
- 1
Multi-GPU training of NaViT model
#322 opened by b5y - 0
Weight Initialization
#321 opened by simonaay - 0
SimpleViT misleading summary
#320 opened by asusdisciple - 0
- 3
Questions about distill_loss
#289 opened by haoren55555 - 1
Validation accuracy higher than training accuracy
#311 opened by yoder460 - 0
Choice for reduced order model / latent space
#315 opened by ramdhan1989 - 0
[MaxViT] Block/Grid Attention question
#314 opened by sonderlau - 1
Whether to include pre-trained models
#307 opened by KawaiiAsh - 0
Swin UNet
#312 opened by sibi-venti - 0
Why Remove PreNorm?
#309 opened by tonyyunyang - 0
Patch Embedding Design Choice?
#310 opened by tonyyunyang - 2
Cuda memory for 3D VIT
#300 opened by JesseZZZZZ - 0
Request for Pre-trained Weights for Vit
#308 opened by ZSLsherly - 3
- 2
- 1
PyPi page markdown render
#302 opened by soumya1729 - 0
A question with ViT 3d
#298 opened by JesseZZZZZ - 2
how to train
#288 opened by lingxitong - 4
Add implementation of LongVit
#297 opened by jpfeil - 0
- 0
Multi-target Regression Question
#295 opened by stethemJ - 0
can we use CvT model for segmentation?
#294 opened by HawkingRadiation42 - 0
Masking attention with batches
#293 opened by ashrafflh - 1
Question regarding 1d fft use
#292 opened by chengengliu - 0
- 4
Layernorm in Cross attention
#287 opened by turtleman99 - 2
CvT with 1 channel input data
#286 opened by tranlg99 - 1
- 3
Not correctly understanding the Multi Head Attention part of the ViT implementation...
#282 opened by JavierUrenaPhDProjects - 1
- 1
- 1
vit_pytorch -> cross_vit.py(mistake)
#279 opened by RufusRubin - 4
structural 3D ViT
#277 opened by aperiamegh - 1
This ViT implementation as generative network
#276 opened by MrCorsair3 - 0
TVM compilation failed on SimpleViT
#275 opened by yangxin0926 - 4
Dimension issues in Masked Patch Prediction
#269 opened by KananVyas - 0
MAE Training
#271 opened by mw9385 - 0
Is it possible to use the "Accessing Attention" of the vit-pytorch on the timm models?
#270 opened by Shima-shoki - 7
ViT for regression task such as Real Estate Price Prediction or Stock Exchange Datasets, any regression dataset.
#259 opened by saifhassan - 0
When running python train_cifar10.py, RuntimeError: An attempt has been made to start a new process before the current process has finished its bootstrapping phase.
#267 opened by laserljy - 2
ViVit pos encoding
#266 opened by eyalmazuz - 2
PyTorch 2.0 support
#262 opened by kxzxvbk - 1
Integrate Aim - an open-source experiment tracker
#263 opened by tatyusha - 0
Multi-head attention part on ViT
#261 opened by andreYoo - 1
Small Typo in CCT description
#258 opened by DSARichard