jc1888822

jc1888822's Stars

eric-ai-lab/Screen-Point-and-Read
Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"
Language:Python182
ZJULiHongxin/AutoGUI
The official implementation of AutoGUI.
Language:Python2
showlab/Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
1144
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python11.7k817
njucckevin/SeeClick
The model, data and code for the visual GUI Agent SeeClick
Language:HTML1779
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Language:Python49.5k16.1k
kdwonn/SaG
Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"
Language:Python322
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.3k1.2k
amusi/AI-Job-Notes
AI算法岗求职攻略（涵盖准备攻略、刷题指南、内推和AI公司清单等资料）
5.1k633
zjh31/CPL
Language:Python19
MarkMoHR/Awesome-Referring-Image-Segmentation
:books: A collection of papers about Referring Image Segmentation.
59456
clownrat6/Out-of-Candidate-Rectification
[cvpr2023] implementation of out-of-candidate rectification methods
151
muyangyi/SimSeg
[CVPR'23] A Simple Framework for Text-Supervised Semantic Segmentation
Language:Python523
Jazzcharles/OVSegmentor
OVSegmentor, CVPR23
Language:Python534
khanrc/tcl
Official implementation of TCL (CVPR 2023)
Language:Python1056
Vibashan/Mask-free-OVIS
Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]
Language:Python471
YuLiu-LY/BO-QSA
This repository is the official implementation of Improving Object-centric Learning With Query Optimization
Language:Python503
linyq2117/CLIP-ES
Language:Python15010
SooLab/CGFormer
The official PyTorch implementation of the CVPR 2023 paper "Contrastive Grouping with Transformer for Referring Image Segmentation".
Language:Python413
fawnliu/TRIS
[ICCV 2023] Official code release of our paper "Referring Image Segmentation Using Text Supervision"
Language:Python582
rulixiang/ToCo
[CVPR 2023] Token Contrast for Weakly-Supervised Semantic Segmentation
Language:Python14711
linhuixiao/CLIP-VG
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
Language:Jupyter Notebook1044
codezakh/SIMLA
[ECCV 22] Single Stream Multi-Level Alignment for Vision Language Pretraining
Language:Python8
zjukg/DUET
[Paper][AAAI 2023] DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
Language:Python468
lezhang7/Enhance-FineGrained
[CVPR' 2024] Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Fine-grained Understanding
Language:Python341
THU-MIG/Consolidator
Official implementation for ICLR 2023 paper Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation
Language:Python143
modestyachts/ImageNetV2_pytorch
ImageNetV2 Pytorch Dataset
Language:Python365
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.5k2.5k
X-PLUG/mPLUG-2
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Language:Python21417
zhangxinsong-nlp/XFM
source code for XFM, a general foundation model for language, vision, and vision-language understanding
Language:Python82