Eli-YiLi's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
open-mmlab/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
whai362/PVT
Official implementation of PVT series
facebookresearch/DomainBed
DomainBed is a suite to test domain generalization algorithms
omerbt/Text2LIVE
Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
huawei-noah/VanillaNet
NVlabs/GroupViT
Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.
YudeWang/SEAM
Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)
OpenGVLab/Multi-Modality-Arena
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
xmed-lab/CLIP_Surgery
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
tjddus9597/Proxy-Anchor-CVPR2020
Official PyTorch Implementation of Proxy Anchor Loss for Deep Metric Learning, CVPR 2020
MediaBrain-SJTU/FACT
xmed-lab/CLIPN
ICCV 2023: CLIPN for Zero-Shot OOD Detection: Teaching CLIP to Say No
hyn2028/llm-cxr
Official code for "LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation"
Eli-YiLi/PMM
Pseudo-mask Matters in Weakly-supervised Semantic Segmentation
xmed-lab/URN
AAAI 2022: Uncertainty Estimation via Response Scaling for Pseudo-Mask Noise Mitigation in Weakly-Supervised Semantic Segmentation
aldraus/quilt-llava
Codebase for Quilt-LLaVA
xmed-lab/OEEM
MICCAI 2022: Online Easy Example Mining for Weakly-supervised Gland Segmentation from Histology Images
TencentAILabHealthcare/ConCL
Eli-YiLi/WSSS_MMSeg
Pseudo-mask Matters in Weakly-supervised Semantic Segmentation
xmed-lab/ICLIP
Exploring Visual Interpretability for Contrastive Language-Image Pretraining
hsgkim/ResNetV2
xmed-lab/INC
Few-Shot Lymph Node Metastasis Classification Meets High Performance on Whole Slide Images via the Informative Non-Parametric Classifier