wwn1233's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
huggingface/trl
Train transformer language models with reinforcement learning.
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
andrewyng/translation-agent
Deep-Agent/R1-V
Witness the aha moment of VLM with less than $3.
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
zhaochenyang20/Awesome-ML-SYS-Tutorial
My learning notes/codes for ML SYS.
EvolvingLMMs-Lab/open-r1-multimodal
A fork to add multimodal model training to open-r1
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
zzli2022/Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
lmarena/arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
NVlabs/DiffiT
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
project-numina/aimo-progress-prize
IceBearAI/LLM-And-More
LLM-And-More is a professional, plug-and-play, llm trainer and application builder that guides you through the complete LLM workflow from data to evaluation, from training to deployment, from idea to sevice. / LLM-And-More 是一个专业、开箱即用的大模型训练及应用构建一站式解决方案,包含从数据到评估、从训练到部署、从想法到服务的全流程最佳实践。
OpenBMB/Eurus
mazzzystar/TurtleBench
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.
cnzzx/VSA
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
MediaBrain-SJTU/GenMedicalEval
mtbench101/mt-bench-101
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
HC-Guo/Awesome-Multimodal-Chain-of-Thought
Collection of papers and repos for multimodal chain-of-thought
dvlab-research/MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
ctlllll/understanding_llm_benchmarks
Understanding the correlation between different LLM benchmarks
JourneyBench/JourneyBench
wwn1233/sedareval
SedarEval: Automated Evaluation using Self-Adaptive Rubrics