Pinned Repositories
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
FasterViT
Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
GCVit
Official PyTorch implementation of Global Context Vision Transformers
Neighborhood-Attention-Transformer
[Preprint] Neighborhood Attention Transformer
DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
VAR
[NeurIPS 2024 Oral][GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
OneFormer
[CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation
sd-webui-cads
Greatly increase the diversity of your generated images in Automatic1111 WebUI through Condition-Annealed Sampling.
achen46's Repositories
achen46/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
achen46/FasterViT
Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
achen46/GCVit
Official PyTorch implementation of Global Context Vision Transformers
achen46/Neighborhood-Attention-Transformer
[Preprint] Neighborhood Attention Transformer