lijiachen8863's Stars
jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
facebookresearch/detr
End-to-End Object Detection with Transformers
ZhangGongjie/IMFA
Seonghoon-Yu/Zero-shot-RIS
[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"
Luodian/RelateAnything
Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
yz93/LAVT-RIS
lifeGWT/LAVT-pytorch
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
VainF/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
yangli18/VLTVG
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
jadore801120/attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
fundamentalvision/Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
dair-ai/ML-Papers-Explained
Explanation to key concepts in ML
NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
wjf5203/SeqFormer
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
luo3300612/Visualizer
assistant tools for attention visualization in deep learning
ShawnBIT/UNet-family
Paper and implementation of UNet-related model.
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
HRNet/HRNet-Semantic-Segmentation
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
TencentARC/MCQ
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
TingsongYu/python-small-examples
告别枯燥,致力于打造 Python 实用小例子,更多Python良心教程见 Python中文网 http://www.zglg.work
TingsongYu/PyTorch_Tutorial
《Pytorch模型训练实用教程》中配套代码
shanglianlm0525/PyTorch-Networks
Pytorch implementation of cnn network
kuangliu/pytorch-cifar
95.47% on CIFAR10 with PyTorch
52CV/CVPR-2022-Papers
JackKuo666/Data_Structure_with_Python
这是我在学习《基于Python的数据结构》的时候的笔记与代码
LeeJunHyun/Image_Segmentation
Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.