YuzaChongyi

YuzaChongyi's Stars

arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.9k455
MBZUAI-LLM/web2code
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Language:Python676
OpenGVLab/OmniCorpus
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Language:Python2817
HKUST-LongGroup/CoMM
Official repository for CoMM Dataset
Language:Python27
mlfoundations/MINT-1T
MINT-1T: A one trillion token multimodal interleaved dataset.
78120
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.8k2.4k
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python144k27.1k
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Language:Python1.5k206
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7.2k459
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.8k897
dalinvip/Awesome-ChatGPT
ChatGPT资料汇总学习，持续更新......
4.1k382
tencent-ailab/IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Language:Jupyter Notebook5.4k344
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python6.2k420
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python14.7k1.2k
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k298
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
Language:Python66757
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀
Language:Shell52.6k11.6k
ddPn08/Radiata
Stable diffusion webui based on diffusers.
Language:Python98269
baichuan-inc/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
Language:Python3k236
OpenBMB/VisCPM
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
Language:Python1.1k92
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13k832
OpenBMB/CPM-Bee
百亿参数的中英文双语基座大模型
Language:Python2.7k215
HenryHZY/Awesome-Multimodal-LLM
Research Trends in LLM-guided Multimodal Learning.
35516
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.7k2.3k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k2.9k
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Language:Python2.5k176
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Language:Python87982
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k976
microsoft/torchscale
Foundation Architecture for (M)LLMs
Language:Python3k209
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python15.2k2.6k