reasoning-language-models
There are 64 repositories under reasoning-language-models topic.
zai-org/GLM-4.5
GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai
reasoning-survey/Awesome-Reasoning-Foundation-Models
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
mims-harvard/TxAgent
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
dvlab-research/Seg-Zero
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
LightChen233/Awesome-Long-Chain-of-Thought-Reasoning
Latest Advances on Long Chain-of-Thought Reasoning
DavidZWZ/Awesome-RAG-Reasoning
[EMNLP 2025] Awesome RAG Reasoning Resources
dvlab-research/VisionReasoner
Vision Manus: Your versatile Visual AI assistant
krystalan/DRT
Deep Reasoning Translation (DRT) Project
mims-harvard/ToolUniverse
ToolUniverse is a collection of biomedical tools designed for AI agents
multimodal-art-projection/LatentCoT-Horizon
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
a-m-team/a-m-models
a-m-team's exploration in large language modeling
codelion/pts
Pivotal Token Search
yihedeng9/OpenVLThinker
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
SalesforceAIResearch/MAS-Zero
Designing Multi-Agent Systems with Zero Supervision
mims-harvard/CUREBench
CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale
spcl/x1
Official Implementation of "Reasoning Language Models: A Blueprint"
Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
The-FinAI/Fino1
This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.
DolbyUUU/Logic-RL-Lite
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
MozerWang/AMPO
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
AI4Phys/SeePhys
Official implementation for the paper "SeePhys: Does Seeing Help Thinking? -- Benchmarking Vision-Based Physics Reasoning"
MaxBelitsky/cache-steering
KV Cache Steering for Inducing Reasoning in Small Language Models
WisdomShell/RewardAnything
RewardAnything: Generalizable Principle-Following Reward Models
DolbyUUU/DeepEnlighten
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
zihao-ai/unthinking_vulnerability
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
tum-ai/number-token-loss
A regression-alike loss to improve numerical reasoning in language models - ICML 2025
zonenoname/CharmBench
A preview-version of one novel multimodal reasoning benchmark CharmBench.
linhaowei1/kumo
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
Hyun-Ryu/clover
Official code for "Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning", ICLR 2025.
thinkwee/NOVER
[EMNLP-2025] R1-Zero on ANY TASK
tomascupr/thinkthread
thinkthread SDK - Supercharge Your AI Applications with Human-Like Reasoning
Trustworthy-ML-Lab/ThinkEdit
An effective weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study uncovering how reasoning length is encoded in the model’s representation space.
sparkle-reasoning/sparkle
Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
parameterlab/leaky_thoughts
Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers"
NellyW8/VeriReason
This is the Github Repo for the paper: VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation
AmanPriyanshu/GPT-OSS-MoE-ExpertFingerprinting
ExpertFingerprinting: Behavioral Pattern Analysis and Specialization Mapping of Experts in GPT-OSS-20B's Mixture-of-Experts Architecture