wwn1233

LLM/MLLM/VIdeo-LLM

wwn1233's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python44.7k 246 6.2k5.5k
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程
Language:Jupyter Notebook13.8k 90 2211.6k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python12.6k 85 1.6k1.7k
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Language:Python7.3k 58 836562
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
Language:Python6.5k 51 1.7k794
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
Language:Python6.4k 33 1.9k548
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python5.7k 34 536557
andrewyng/translation-agent
Language:Python5.3k 56 19635
Deep-Agent/R1-V
Witness the aha moment of VLM with less than $3.
Language:Python3.3k 46 136261
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
Language:Python2.9k 249 47374
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Language:Python1.7k 10 71133
zhaochenyang20/Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
Language:Python1.5k 10 576
EvolvingLMMs-Lab/open-r1-multimodal
A fork to add multimodal model training to open-r1
Language:Python1.1k 12 2357
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python847 8 8159
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Language:Python816 7 2449
zzli2022/Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
Language:Python794 12 827
lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
Language:Python759 8 3993
NVlabs/DiffiT
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
488 55 518
project-numina/aimo-progress-prize
Language:Jupyter Notebook413 7 1032
IceBearAI/LLM-And-More
LLM-And-More is a professional, plug-and-play, llm trainer and application builder that guides you through the complete LLM workflow from data to evaluation, from training to deployment, from idea to sevice. / LLM-And-More 是一个专业、开箱即用的大模型训练及应用构建一站式解决方案，包含从数据到评估、从训练到部署、从想法到服务的全流程最佳实践。
Language:Go380 25 859
OpenBMB/Eurus
Language:Python311 11 1114
mazzzystar/TurtleBench
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.
Language:Jupyter Notebook143 4 49
cnzzx/VSA
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Language:Python117 3 49
MediaBrain-SJTU/GenMedicalEval
80 0 47
mtbench101/mt-bench-101
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
77 6 1139
HC-Guo/Awesome-Multimodal-Chain-of-Thought
Collection of papers and repos for multimodal chain-of-thought
67 2 03
dvlab-research/MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
Language:Python45 2 41
ctlllll/understanding_llm_benchmarks
Understanding the correlation between different LLM benchmarks
Language:Jupyter Notebook29 2 00
JourneyBench/JourneyBench
Language:Python2 1 01
wwn1233/sedareval
SedarEval: Automated Evaluation using Self-Adaptive Rubrics
Language:Python2 1 00

wwn1233

wwn1233's Stars

hiyouga/LLaMA-Factory

datawhalechina/self-llm

huggingface/trl

OpenGVLab/InternVL

MaartenGr/BERTopic

modelscope/ms-swift

OpenRLHF/OpenRLHF

andrewyng/translation-agent

Deep-Agent/R1-V

PKU-Alignment/align-anything

openreasoner/openr

zhaochenyang20/Awesome-ML-SYS-Tutorial

EvolvingLMMs-Lab/open-r1-multimodal

princeton-nlp/SimPO

ContextualAI/HALOs

zzli2022/Awesome-System2-Reasoning-LLM

lmarena/arena-hard-auto

NVlabs/DiffiT

project-numina/aimo-progress-prize

IceBearAI/LLM-And-More

OpenBMB/Eurus

mazzzystar/TurtleBench

cnzzx/VSA

MediaBrain-SJTU/GenMedicalEval

mtbench101/mt-bench-101

HC-Guo/Awesome-Multimodal-Chain-of-Thought

dvlab-research/MR-GSM8K

ctlllll/understanding_llm_benchmarks

JourneyBench/JourneyBench

wwn1233/sedareval