qwen-vl

There are 5 repositories under qwen-vl topic.

PaddlePaddle/PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Language:Python292 22 131110
gokayfem/awesome-vlm-architectures
Famous Vision Language Models and Their Architectures
Language:Markdown267 9 218
zjysteven/lmms-finetune
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, qwen-vl, phi3-v etc.
Language:Python115 4 188
reidbarber/webmarker
Mark web pages for use with multimodal large language models
Language:TypeScript10 1 01
autodistill/autodistill-qwen-vl
Qwen-VL base model for use with Autodistill.
Language:Python4 1