lucidrains/vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
PythonMIT
Issues
- 0
Swin UNet
#312 opened by sibi-venti - 0
Validation accuracy higher than training accuracy
#311 opened by yoder460 - 0
Why Remove PreNorm?
#309 opened by tonyyunyang - 0
Patch Embedding Design Choice?
#310 opened by tonyyunyang - 2
Cuda memory for 3D VIT
#300 opened by JesseZZZZZ - 0
Request for Pre-trained Weights for Vit
#308 opened by ZSLsherly - 0
Whether to include pre-trained models
#307 opened by KawaiiAsh - 3
- 2
- 1
PyPi page markdown render
#302 opened by soumya1729 - 0
A question with ViT 3d
#298 opened by JesseZZZZZ - 2
how to train
#288 opened by lingxitong - 4
Add implementation of LongVit
#297 opened by jpfeil - 0
- 0
Multi-target Regression Question
#295 opened by stethemJ - 0
can we use CvT model for segmentation?
#294 opened by HawkingRadiation42 - 0
Masking attention with batches
#293 opened by ashrafflh - 0
How to use torchvision.models.feature_extraction.create_feature_extractor() with vit_pytorch?
#254 opened by ArturasDruteika - 1
Question regarding 1d fft use
#292 opened by chengengliu - 0
- 1
Questions about distill_loss
#289 opened by haoren55555 - 4
Layernorm in Cross attention
#287 opened by turtleman99 - 2
CvT with 1 channel input data
#286 opened by tranlg99 - 1
- 3
Not correctly understanding the Multi Head Attention part of the ViT implementation...
#282 opened by JavierUrenaPhDProjects - 1
- 1
- 1
vit_pytorch -> cross_vit.py(mistake)
#279 opened by RufusRubin - 4
structural 3D ViT
#277 opened by aperiamegh - 1
This ViT implementation as generative network
#276 opened by MrCorsair3 - 0
TVM compilation failed on SimpleViT
#275 opened by yangxin0926 - 4
Dimension issues in Masked Patch Prediction
#269 opened by KananVyas - 0
MAE Training
#271 opened by mw9385 - 0
Is it possible to use the "Accessing Attention" of the vit-pytorch on the timm models?
#270 opened by Shima-shoki - 7
ViT for regression task such as Real Estate Price Prediction or Stock Exchange Datasets, any regression dataset.
#259 opened by saifhassan - 2
- 0
When running python train_cifar10.py, RuntimeError: An attempt has been made to start a new process before the current process has finished its bootstrapping phase.
#267 opened by laserljy - 2
ViVit pos encoding
#266 opened by eyalmazuz - 2
PyTorch 2.0 support
#262 opened by kxzxvbk - 1
Integrate Aim - an open-source experiment tracker
#263 opened by tatyusha - 2
- 0
Multi-head attention part on ViT
#261 opened by andreYoo - 1
Small Typo in CCT description
#258 opened by DSARichard - 0
Using SimpleVit to estimate odometry
#256 opened by Deadrosas - 0
Apply Tanh activation function to ViT - MLP Head
#255 opened by joeycouse - 7
[Feature Request] ViTDet
#252 opened by austinmw - 1
Training a VIT from pre-trained patches embeddings
#251 opened by AdrianBZG - 0
add an interpolate_embeddings helper function
#249 opened by DanTaranis - 3
How to use mask in ViT
#246 opened by Ma-Zijing - 1
The problem of reprinting vivit
#248 opened by kuangxiaoye