ottolu's Stars
deepseek-ai/DeepSeek-R1
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
huggingface/trl
Train transformer language models with reinforcement learning.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
deepseek-ai/FlashMLA
FlashMLA: Efficient MLA decoding kernels
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
deepseek-ai/DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
volcengine/verl
verl: Volcano Engine Reinforcement Learning for LLMs
huggingface/deep-rl-class
This repo contains the Hugging Face Deep Reinforcement Learning Course.
MoonshotAI/Kimi-k1.5
hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
fla-org/flash-linear-attention
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
btahir/open-deep-research
Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.
bespokelabsai/curator
Synthetic data curation for post-training and structured data extraction
zhentingqi/rStar
LMD0311/Awesome-World-Model
Collect some World Models for Autonomous Driving (and Robotic) papers.
SCLBD/DeepfakeBench
A comprehensive benchmark of deepfake detection
flyingby/Awesome-Deepfake-Generation-and-Detection
A Survey on Deepfake Generation and Detection
SenseTime-FVG/OpenDWM
An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.
djghosh13/geneval
GenEval: An object-focused framework for evaluating text-to-image alignment
facebookresearch/DCI
Densely Captioned Images (DCI) dataset repository.
rongyaofang/GoT
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
princeton-nlp/CharXiv
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Mark12Ding/Dispider
[CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
Alpha-Innovator/GeoX
[ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training