hunto's Stars
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
facebookresearch/deit
Official DeiT repository
huggingface/swift-coreml-diffusers
Swift app demonstrating Core ML Stable Diffusion
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
VainF/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
RayeRen/acad-homepage.github.io
AcadHomepage: A Modern and Responsive Academic Personal Homepage
HuangJunJie2017/BEVDet
Code base of the BEVDet series .
chaytonmin/Awesome-BEV-Perception-Multi-Cameras
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD
YoungXIAO13/ObjectPoseEstimationSummary
Resources (papers, datasets, rendering methods) in the domain of object pose estimation.
NVlabs/denoising-diffusion-gan
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804
4DVLab/Vision-Centric-BEV-Perception
Vision-Centric BEV Perception: A Survey
raoyongming/DynamicViT
[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification
ViTAE-Transformer/ViTDet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object Detection"
Nota-NetsPresso/BK-SDM
A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]
automl/awesome-transformer-search
A curated list of awesome resources combining Transformers with Neural Architecture Search
winycg/CIRKD
[CVPR-2022] Official implementations of CIRKD: Cross-Image Relational Knowledge Distillation for Semantic Segmentation and implementations on Cityscapes, ADE20K, COCO-Stuff., Pascal VOC and CamVid.
LayneH/GreenMIM
[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.
hunto/DIST_KD
Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022
hunto/LightViT
Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"
jd-opensource/TeD-Q
TeD-Q (Tensor-network enhanced Distributed Quantum) is a tensor network enhanced distributed hybrid quantum machine learning framework.
mingkai-zheng/GENIUS
Can GPT-4 Perform Neural Architecture Search?
hunto/DiffKD
Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023
hunto/MasKD
Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.
LayneH/LEWEL
[CVPR2022] Official Implementation of the paper 'Learning Where to Learn in Cross-View Self-Supervised Learning'
Gumpest/AvatarKD
[MM'22] Official implementation of paper "Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty".
mingkai-zheng/CoNe
xiusu/BCNetV2