Mike-YANG-11

Mike-YANG-11's Stars

zhengli97/PromptKD
[CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"
Language:Python2493
ShuweiShao/AF-SfMLearner
[MedIA2022 & ICRA2021] Self-Supervised Monocular Depth and Ego-Motion Estimation in Endoscopy: Appearance Flow to the Rescue
Language:Python10515
EdisonLeeeee/Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
78852
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
Language:Jupyter Notebook9.4k844
HSG-AIML/GDA
Code repository for "Parameter Efficient Self-supervised Geospatial Domain Adaptation", CVPR 2024
Language:Python272
samar-khanna/ExPLoRA
Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"
264
FreedomIntelligence/HuatuoGPT-Vision
Medical Multimodal LLMs
Language:Python26022
tntek/source-free-domain-adaptation
Language:Python1458
CapsuleEndoscope/EndoSLAM
EndoSLAM Dataset and an Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner
Language:Python24146
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
Language:Jupyter Notebook84836
LeapLabTHU/EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
Language:Python2139
naver/unic
PyTorch code and pretrained weights for the UNIC models.
Language:Python271
ESandML/SSL4GIE
Official code repository for: A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal Endoscopy
Language:Python5
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python57.2k5.8k
tianyu0207/weakly-polyp
[MICCAI'22] Contrastive Transformer-based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection.
Language:Python404
bfshi/scaling_on_scales
When do we not need larger vision models?
Language:Python34211
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Language:Python30914
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python4.1k356
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.7k490
Malitha123/awesome-video-self-supervised-learning
A curated list of awesome self-supervised learning methods in videos
1164
facebookresearch/sscd-copy-detection
Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).
Language:Python26720
zhaozh10/Awesome-CLIP-in-Medical-Imaging
A Survey on CLIP in Medical Imaging
35421
facebookresearch/maws
Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496
Language:Jupyter Notebook824
WeixiongLin/PMC-CLIP
The official codes for "PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents"
Language:Python20712
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python2k128
NVlabs/FoundationPose
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Language:Python1.6k217
jhairgallardo/awesome-continual-self-supervised-learning
List of papers that combine self-supervision and continual learning
611
uncbiag/Awesome-Foundation-Models
A curated list of foundation models for vision and language tasks
87540
yeerwen/MedCoSS
CVPR 2024 (Highlight)
Language:Python1144
JiazuoYu/MoE-Adapters4CL
Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024
Language:Python16712