Pinned Repositories
VQLoC
(NeurIPS 2023) Open-set visual object query search & localization in long-form videos
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
MamTrans
Official coda release for State Space Models Meet Transformers for Hyperspectral Image Classification
PPPPPsanG's Repositories
PPPPPsanG/MamTrans
Official coda release for State Space Models Meet Transformers for Hyperspectral Image Classification