Lauch1ng

Growing strong.

Lauch1ng's Stars

WenjunHuang94/ML-Mamba
ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2
Language:Jupyter Notebook457
CircleRadon/TokenPacker
The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".
Language:Python1516
ShiArthur03/ShiArthur03
Language:MATLAB10.4k1.9k
AILab-CVC/UniRepLKNet
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Language:Python89853
Lauch1ng/LKRobust
Language:Python459
TRI-ML/vlm-evaluation
VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
Language:Python7710
MzeroMiko/VMamba
VMamba: Visual State Space Models，code is based on mamba
Language:Python2k120
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python4.8k364
kyegomez/MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
Language:Python43023
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.3k2.1k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
11.8k759
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
Language:Python91684
A-LinCui/Adversarial_Patch_Attack
Pytorch implementation of Adversarial Patch on ImageNet (arXiv: https://arxiv.org/abs/1712.09665)
Language:Python5214
Muzammal-Naseer/IPViT
Official repository for "Intriguing Properties of Vision Transformers" (NeurIPS 2021--Spotlight)
Language:Python17619
facebookresearch/ConvNeXt-V2
Code release for ConvNeXt V2 model
Language:Python1.5k117
jianlong-yuan/UniNeXt
Language:Jupyter Notebook427
AbrahamYabo/SdAE
Language:Python544
openai/guided-diffusion
Language:Python6.1k808
Visual-Attention-Network/VAN-Classification
Language:Python81285

Lauch1ng

Lauch1ng's Stars

WenjunHuang94/ML-Mamba

CircleRadon/TokenPacker

ShiArthur03/ShiArthur03

AILab-CVC/UniRepLKNet

Lauch1ng/LKRobust

TRI-ML/vlm-evaluation

MzeroMiko/VMamba

QwenLM/Qwen-VL

kyegomez/MultiModalMamba

haotian-liu/LLaVA

BradyFU/Awesome-Multimodal-Large-Language-Models

microsoft/SimMIM

A-LinCui/Adversarial_Patch_Attack

Muzammal-Naseer/IPViT

facebookresearch/ConvNeXt-V2

jianlong-yuan/UniNeXt

AbrahamYabo/SdAE

openai/guided-diffusion

Visual-Attention-Network/VAN-Classification