duan-song's Stars
jianzongwu/Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
wangf3014/SCLIP
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
tue-mps/benchmark-vfm-ss
mlfoundations/open_clip
An open source implementation of CLIP.
AntonioTepsich/Convolutional-KANs
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.
zh460045050/DA-WSOL_CVPR2022
Official implementation of the paper ``Weakly Supervised Object Localization as Domain Adaption"
zcablii/SARDet_100K
[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection
facebookresearch/CutLER
Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"
Event-AHU/Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
facebookresearch/dinov2
PyTorch code and models for the DINOv2 self-supervised learning method.
LeapLabTHU/Agent-Attention
Official repository of Agent Attention (ECCV2024)
shjo-april/DHR
[ECCV 2024] DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation
duan-song/M2DFNet-for-ICIP-2022
javiribera/locating-objects-without-bboxes
PyTorch code for "Locating objects without bounding boxes" - Loss function and trained models
lxtGH/OMG-Seg
OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
goodvision/awesome-image-retrieval
图像检索一些好的开源代码
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
lukemelas/EfficientNet-PyTorch
A PyTorch implementation of EfficientNet
THU-LYJ-Lab/T3Bench
T3Bench: Benchmarking Current Progress in Text-to-3D Generation
wenhwu/awesome-remote-sensing-change-detection
List of datasets, codes, and contests related to remote sensing change detection
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
macanv/BERT-BiLSTM-CRF-NER
Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services
juand-r/entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
amusi/daily-paper-computer-vision
记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文
DengPingFan/D3NetBenchmark
Rethinking RGB-D Salient Object Detection: Models, Datasets, and Large-Scale Benchmarks, IEEE TNNLS 2020