zhangqingwu

Pinned Repositories

SEED
Official implementation of SEED-LLaMA (ICLR 2024).
Language:Python589 15 5032
Bunny
A family of lightweight multimodal models.
Language:Python966 20 12872
Emu3
Next-Token Prediction is All You Need
Language:Python1.9k 32 5477
1d-tokenizer
This repo contains the code for 1D tokenizer and generator
Language:Jupyter Notebook613 13 5329
VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.5k 121 107429
OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Language:Python289 12 97
sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python6.8k 63 805618
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Python0 0 00

zhangqingwu/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Python0 0 00