ottolu

Microsoft Research AsiaBeijing, China

ottolu's Stars

deepseek-ai/DeepSeek-R1
86.6k 614 47711.2k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Language:Python44.4k 245 6.2k5.4k
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Language:Python22.9k 308 2772.1k
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
Language:Python16.7k 149 1612.2k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python12.5k 82 1.6k1.7k
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python12k 88 1.5k1.3k
deepseek-ai/FlashMLA
FlashMLA: Efficient MLA decoding kernels
Language:C++11.3k 98 51793
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
Language:Jupyter Notebook7.7k 83 127496
LargeWorldModel/LWM
Large World Model -- Modeling Text and Video with Millions Context
Language:Python7.3k 66 72557
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python5.7k 34 535551
deepseek-ai/DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Language:Cuda5k 44 41502
volcengine/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Language:Python4.9k 35 265479
huggingface/deep-rl-class
This repo contains the Hugging Face Deep Reinforcement Learning Course.
Language:MDX4.2k 80 327652
MoonshotAI/Kimi-k1.5
3.2k 44 21194
hkust-nlp/simpleRL-reason
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Language:Python3.2k 33 45234
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
Language:Python2.8k 30 143225
fla-org/flash-linear-attention
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Language:Python2.1k 27 148132
btahir/open-deep-research
Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.
Language:TypeScript1.6k 12 27148
bespokelabsai/curator
Synthetic data curation for post-training and structured data extraction
Language:Python1k 5 18975
zhentingqi/rStar
Language:Python908 7 26105
LMD0311/Awesome-World-Model
Collect some World Models for Autonomous Driving (and Robotic) papers.
827 42 230
SCLBD/DeepfakeBench
A comprehensive benchmark of deepfake detection
Language:Python675 18 151101
flyingby/Awesome-Deepfake-Generation-and-Detection
A Survey on Deepfake Generation and Detection
424 17 319
SenseTime-FVG/OpenDWM
An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.
Language:Python199 8 1033
djghosh13/geneval
GenEval: An object-focused framework for evaluating text-to-image alignment
Language:HTML195 1 129
facebookresearch/DCI
Densely Captioned Images (DCI) dataset repository.
Language:Python171 4 155
rongyaofang/GoT
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
Language:Jupyter Notebook1154
princeton-nlp/CharXiv
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
Language:Python99 3 1211
Mark12Ding/Dispider
[CVPR 2025]Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
Language:Python90 10 15
Alpha-Innovator/GeoX
[ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
Language:Python26 2 92

ottolu

ottolu's Stars

deepseek-ai/DeepSeek-R1

hiyouga/LLaMA-Factory

huggingface/open-r1

deepseek-ai/Janus

huggingface/trl

sgl-project/sglang

deepseek-ai/FlashMLA

NVIDIA/Cosmos

LargeWorldModel/LWM

OpenRLHF/OpenRLHF

deepseek-ai/DeepGEMM

volcengine/verl

huggingface/deep-rl-class

MoonshotAI/Kimi-k1.5

hkust-nlp/simpleRL-reason

THUDM/GLM-4-Voice

fla-org/flash-linear-attention

btahir/open-deep-research

bespokelabsai/curator

zhentingqi/rStar

LMD0311/Awesome-World-Model

SCLBD/DeepfakeBench

flyingby/Awesome-Deepfake-Generation-and-Detection

SenseTime-FVG/OpenDWM

djghosh13/geneval

facebookresearch/DCI

rongyaofang/GoT

princeton-nlp/CharXiv

Mark12Ding/Dispider

Alpha-Innovator/GeoX