Pinned Repositories
Emu3
Next-Token Prediction is All You Need
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
MamTrans
Official coda release for State Space Models Meet Transformers for Hyperspectral Image Classification
PPPPPsanG's Repositories
PPPPPsanG/MamTrans
Official coda release for State Space Models Meet Transformers for Hyperspectral Image Classification