Pinned Repositories
Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
DiffusionRet
[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model
shikra
MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Awesome-Remote-Sensing-Multimodal-Large-Language-Model
Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey
Mono3DVG
[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024
PE-RSITR
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023
RSVG-pytorch
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
SkyEyeGPT
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
ZhanYang-nwpu
ZhanYang-nwpu's Repositories
ZhanYang-nwpu/Awesome-Remote-Sensing-Multimodal-Large-Language-Model
Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey
ZhanYang-nwpu/RSVG-pytorch
RSVG: Exploring Data and Model for Visual Grounding on Remote Sensing Data, 2022
ZhanYang-nwpu/SkyEyeGPT
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model
ZhanYang-nwpu/Mono3DVG
[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024
ZhanYang-nwpu/PE-RSITR
Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023
ZhanYang-nwpu/ZhanYang-nwpu