/LLaVA-Magvit2

LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.

Primary LanguagePython

Pinned issues

Progress Status

#2 opened by lucasjinreal

Open12

Issues