Pinned Repositories
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
WeakTr
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
deeplabv1-resnet38
lab-info-collection
LLaVA-MobileLLaMA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
Mamba-LLaVA
OneFormer
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Yingyue-L's Repositories
Yingyue-L/Mamba-LLaVA
Yingyue-L/deeplabv1-resnet38
Yingyue-L/lab-info-collection
Yingyue-L/LLaVA-MobileLLaMA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Yingyue-L/mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
Yingyue-L/OneFormer
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
Yingyue-L/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.