22109095

22109095's Stars

HCPLab-SYSU/Embodied_AI_Paper_List
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
91867
JiazuoYu/Awesome-Prompt-Adapter-Learning-for-Vision-Language-Models
A curated list of prompt/adapter learning methods for vision-language models.
3
JiazuoYu/PathWeave
Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024
Language:Jupyter Notebook271
DAMO-NLP-SG/Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Language:Python2.9k265
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
Language:Python2.1k35
Paranioar/SGRAF
[AAAI2021] The code of “Similarity Reasoning and Filtration for Image-Text Matching”
Language:Python21436
Paranioar/RCAR
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
Language:Python293
Paranioar/UniPT
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
Language:Python651
Paranioar/Awesome_Image_Text_Retrieval_Benchmark
The Unified Code of Image-Text Retrieval for Further Exploration.
Language:Python1
Paranioar/DBL
[TIP2024] The code of “Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching”
Language:Python8
Rongtao-Xu/Awesome-LLM-EN
945
Lionelsy/Conference-Accepted-Paper-List
Some Conferences' accepted paper lists (including AI, ML, Robotic)
97774
YicongHong/Thinking-VLN
Ideas and thoughts about the fascinating Vision-and-Language Navigation
16212
Paranioar/Awesome_Matching_Pretraining_Transfering
The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.
40547
JiazuoYu/MoE-Adapters4CL
Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024
Language:Python17113
pengsida/learning_research
本人的科研经验
6.1k366
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.7k490
micronDLA/MobileViTv3
Language:Python22422
22109095/SimOWT
This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.
Language:Python11
hkchengrex/Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Language:Python1.3k127
statusrank/XCurve
XCurve is an end-to-end PyTorch library for X-Curve metrics optimizations in machine learning.
Language:Python1429
jianghaojun/Awesome-Parameter-Efficient-Transfer-Learning
A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.
39225
longzw1997/Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
Language:Python48478
jiawen-zhu/ViPT
[CVPR23] Visual Prompt Multi-Modal Tracking
Language:Python26918
MasterBin-IIAU/UNINEXT
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
Language:Python1.5k158
MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language:Python95187
YangLiu14/detectron2-OWT
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
Language:Python42
YangLiu14/Open-World-Tracking
Official code for "Opening up Open World Tracking" (CVPR 2022)
Language:Python545