JMcarrot

JMcarrot's Stars

ai-dawang/PlugNPlay-Modules
Language:Python48843
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
Language:Python1.8k164
PKINet/PKINet
Official implementation of CVPR2024 Paper "Poly Kernel Inception Network for Remote Sensing Detection".
Language:Python3812
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
1k29
Ruixxxx/Awesome-Vision-Mamba-Models
[Official Repo] A Survey on Vision Mamba: Models, Applications and Challenges
44029
datawhalechina/tiny-universe
《大模型白盒子构建指南》：一个全手搓的Tiny-Universe
Language:Python1.1k107
henghuiding/Vision-Language-Transformer
[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation
Language:Python34423
awaisrauf/Awesome-CV-Foundational-Models
45027
SkalskiP/awesome-foundation-and-multimodal-models
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
Language:Python56643
xiuqhou/Relation-DETR
[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"
Language:Python986
ZhanYang-nwpu/Awesome-Remote-Sensing-Multimodal-Large-Language-Model
Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey
1284
zytx121/Awesome-VLGFM
A Survey on Vision-Language Geo-Foundation Models (VLGFMs)
1087
xg416/DATUM
Official repo for the Deep Atmospheric TUrbulence Mitigation network
Language:Python222
vlislab22/AIAA-5027
192
linhuixiao/CLIP-VG
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
Language:Jupyter Notebook1055
impiga/Plain-DETR
[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design
Language:Python1923
yuyongcan/Benchmark-TTA
Language:Python484
kkakkkka/ETRIS
[ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
Language:Python943
dongzhang89/FPT
Implementation for paper: Feature Pyramid Transformer
Language:Python40062
QY1994-0919/CFPNet
Centralized Feature Pyramid for Object Detection
Language:Python23722
duzw9311/CFPT
Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images
Language:Python401
wjiazheng/SwinPA-Net
SwinPA-Net: Swin Transformer-Based Multiscale Feature Pyramid Aggregation Network for Medical Image Segmentation
Language:Python4
amazon-science/polygon-transformer
Language:Python1257
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.6k487
MenghaoGuo/Awesome-Vision-Attentions
Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
Language:Python2.7k410
TheShadow29/awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
1k97
linhuixiao/Awesome-Visual-Grounding
A Survey on Visual Grounding
3
open-mmlab/mmcv
OpenMMLab Computer Vision Foundation
Language:Python5.8k1.6k
52CV/CVPR-2024-Papers
72243
jingyi0000/VLM_survey
Collection of AWESOME vision-language models for vision tasks
2.3k203