1349949

1349949's Stars

yt-dlp/yt-dlp
A feature-rich command-line audio/video downloader
Language:Python96.9k 546 8.5k7.6k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.8k 560 4.3k10.2k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python20.6k 306 1.4k2.6k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.6k 263 130863
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.2k 66 71554
PKU-YuanGroup/ChatLaw
ChatLaw：A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
7.1k 37 78551
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python6.3k 69 433426
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python5.3k 50 456404
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Language:Python5.1k 23 1.5k442
SCIR-HI/Huatuo-Llama-Med-Chinese
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草（原名：华驼）模型仓库，基于中文医学知识的大语言模型指令微调
Language:Python4.6k 49 107464
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.5k 27 587478
shibing624/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Language:Python3.5k 40 401516
ytongbai/LVM
Language:Python1.8k 117 2455
shikras/shikra
Language:Python756 8 6645
mlfoundations/datacomp
DataComp: In search of the next generation of multimodal datasets
Language:Python674 17 6556
BradyFU/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
Language:Python627 16 1330
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Language:Python588 27 3533
baaivision/Uni3D
[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI
Language:Python516 15 2630
magic-research/bubogpt
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs
Language:Python506 10 1935
OpenGVLab/all-seeing
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"
Language:Python472 23 2417
TonyLianLong/LLM-groundedDiffusion
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD, TMLR 2024)
Language:Python445 13 1929
OpenDriveLab/ST-P3
[ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.
Language:Python345 9 2738
LinShan-Bin/OccNeRF
Code of "OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments".
Language:Python325 10 2718
tsb0601/MMVP
Language:Python304 10 278
bytedance/lynx-llm
paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/
Language:Python237 8 68
FreedomIntelligence/Huatuo-26M
The Largest-scale Chinese Medical QA Dataset： with 26,000,000 question answer pairs.
234 9 1024
E2E-AD/AD-MLP
Language:Python201 9 515
wudongming97/Prompt4Driving
[AAAI2025] Language Prompt for Autonomous Driving
Language:Python127 12 60
mynameischaos/Lion
Lion: Kindling Vision Intelligence within Large Language Models
52 2 51
will-singularity/Skywork-MM
Empirical Study Towards Building An Effective Multi-Modal Large Language Model
23 2 21