r1

There are 51 repositories under r1 topic.

zzli2022/Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
Language:Python1.2k 11 869
turningpoint-ai/VisualThinker-R1-Zero
Explore the Multimodal “Aha Moment” on 2B Model
Language:Python608 15 1022
jingyi0000/R1-VL
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
Language:Python4230
modelscope/awesome-deep-reasoning
Collect every awesome work about r1!
Language:Python416 6 015
XiaoYee/Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
2934
DMontgomery40/deepseek-mcp-server
Model Context Protocol server for DeepSeek's advanced language models
Language:JavaScript271 1 217
RyanLiu112/compute-optimal-tts
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
Language:Python271 8 1421
SmallDoges/small-doge
Doge Family of Small Language Models
Language:Python173 3 613
sun-hailong/TVC
[ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" in PyTorch.
Language:Python143 1 11
CJReinforce/PURE
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
Language:Python136 2 23
RyanLiu112/Awesome-Process-Reward-Models
A comprehensive collection of process reward models.
108 1 01
RyanLiu112/GenPRM
Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
Language:Python810
HJYao00/Awesome-Reasoning-MLLM
Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1
57 1 02
The-Martyr/Awesome-Multimodal-Reasoning
Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models
360
LazaUK/AIFoundry-DeepSeek-SDK
Notebooks to demo the use of Azure AI Python SDK / LangChain with DeepSeek R1 reasoning model in Azure AI Foundry.
Language:Jupyter Notebook31 2 06
glide-the/InterpretationoDreams
基于langchain设计的智能体任务，包含规划会话场景资源，构建子任务，任务执行器包含（MCTS）
Language:Jupyter Notebook29 1 02
sylvain-wei/24-Game-Reasoning
超简单复现Deepseek-R1-Zero和Deepseek-R1，以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL，以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of DeepSeek R1-Zero, DeepSeek R1
Language:Python26 1 22
lachlancresswell/AutoR1
Auto-generate fallback and meter display from existing group info in d&b audiotechnik's R1 and ArrayCalc software.
Language:Python21 7 72
sdiehl/tiny-r1
Recreating the minimal training methods of DeepSeek-R1 for small langauge models.
Language:Python21 1 03
The-Swarm-Corporation/AgentGym
A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1
Language:Python20 1 0
BY571/DistRL-LLM
Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
Language:Python19 1 01
IoTDevice/phicomm-r1-controler
斐讯R1音箱控制程序
Language:Go19 1 02
tyler-romero/microR1
Simple repository for training small reasoning models
Language:Python121
ericsson-iap/python-sample-app
Python Sample App for SMO Systems like Ericsson Intelligent Automation Platform. We aim to be ORAN aligned. Use this to kickstart your own app!
Language:Python11 5 10
nschlaepfer/ChainForge-R1-SuperCoT
A multi-stage pipeline that enhances Qwen2.5 language models with DeepSeek Reasoner's chain-of-thought capabilities. Implements the DeepSeek-R1 methodology through cold-start SFT, reasoning-oriented RL, rejection sampling, and optional model distillation.
Language:Python10 2 03
lechmazur/goods
LLM public goods game
8 1 00
OnerootProject/r1
R1 Protocol
Language:JavaScript7 5 05
Xuchen-Li/OvO-R1
Exploring the influence of using end-to-end reinforcement learning and various reward functions on the reasoning capabilities of different 1.5B base models.
Language:Python50
ericsson-iap/go-sample-app
Go Sample App for SMO Systems like Ericsson Intelligent Automation Platform. We aim to be ORAN aligned. Use this to kickstart your own app!
Language:Go4 3 01
PINT-NMR/PINT
NMR spectroscopy software for line shape fitting and downstream analysis
Language:HTML4 1 01
Trae1ounG/Chinese-Logic-RL
Exploring R1 on Logic Puzzle in Chinese
Language:Python3
Berstarhunter/deepseek-start
deepseek-start is a powerful tool designed for deep searching and analysis of large datasets, allowing users to efficiently navigate through complex data structures with ease. With its intuitive interface and advanced algorithms, deepseek-start provides researchers and analysts with the means to uncover valuable insights and patterns hidden within
2 1 00
Kuberwastaken/free-deep-research
My free implementation of @dzhng's implementation of OpenAI's new Deep Research agent. Get (almost) the same capability for free. You can even tweak the behavior of the agent with adjustable breadth and depth. Run it for 5 min or 5 hours, it'll auto adjust :)
Language:TypeScript2 1 00
Kuberwastaken/TREAT-R1
A DeepSeek R1 version of TREAT: An Open-Source AI Web App to Detect Triggering Content in Movies and Shows
Language:Python2 1 01
NEBYTE/deepseek-rs
DeepSeek-RS is a personal project implementing DeepSeek's architecture in Rust for learning and experimentation. This is not an official DeepSeek project.
Language:Rust20
SYSTEMS-OPERATOR/SUPER-POLE-POSITION
HYPERPOLE GYM
Language:Python1 1 00

r1

zzli2022/Awesome-System2-Reasoning-LLM

turningpoint-ai/VisualThinker-R1-Zero

jingyi0000/R1-VL

modelscope/awesome-deep-reasoning

XiaoYee/Awesome_Efficient_LRM_Reasoning

DMontgomery40/deepseek-mcp-server

RyanLiu112/compute-optimal-tts

SmallDoges/small-doge

sun-hailong/TVC

CJReinforce/PURE

RyanLiu112/Awesome-Process-Reward-Models

RyanLiu112/GenPRM

HJYao00/Awesome-Reasoning-MLLM

The-Martyr/Awesome-Multimodal-Reasoning

LazaUK/AIFoundry-DeepSeek-SDK

glide-the/InterpretationoDreams

sylvain-wei/24-Game-Reasoning

lachlancresswell/AutoR1

sdiehl/tiny-r1

The-Swarm-Corporation/AgentGym

BY571/DistRL-LLM

IoTDevice/phicomm-r1-controler

tyler-romero/microR1

ericsson-iap/python-sample-app

nschlaepfer/ChainForge-R1-SuperCoT

lechmazur/goods

OnerootProject/r1

Xuchen-Li/OvO-R1

ericsson-iap/go-sample-app

PINT-NMR/PINT

Trae1ounG/Chinese-Logic-RL

Berstarhunter/deepseek-start

Kuberwastaken/free-deep-research

Kuberwastaken/TREAT-R1

NEBYTE/deepseek-rs

SYSTEMS-OPERATOR/SUPER-POLE-POSITION