Pinned Repositories
3D-TransUNet
This is the official repository for the paper "3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers"
beckschen.github.io
homepage of Jieneng Chen
LLaVolta
Efficient Multi-modal Models via Stage-wise Visual Context Compression
open_clip
An open source implementation of CLIP.
TransMix
[CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.
TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
ViTamin
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
open_clip
An open source implementation of CLIP.
TransFG
This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).
Beckschen's Repositories
Beckschen/TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
Beckschen/3D-TransUNet
This is the official repository for the paper "3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers"
Beckschen/ViTamin
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
Beckschen/TransMix
[CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.
Beckschen/LLaVolta
Efficient Multi-modal Models via Stage-wise Visual Context Compression
Beckschen/beckschen.github.io
homepage of Jieneng Chen
Beckschen/open_clip
An open source implementation of CLIP.
Beckschen/pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more