lucasjinreal/LLaVA-Magvit2

LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.

Python

Pinned issues

Progress Status

#2 opened 9 months ago by lucasjinreal

Open12

Issues

Progress Status
#2 opened 9 months ago by lucasjinreal
12
Have a tech report or paper?
#1 opened 10 months ago by daixiangzi
3