llm-reasoning
There are 20 repositories under llm-reasoning topic.
inclusionAI/AReaL
Distributed RL System for LLM Reasoning
YangLing0818/buffer-of-thought-llm
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
yinizhilian/ICLR2025-Papers-with-Code
历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.
bruno686/Awesome-RL-based-LLM-Reasoning
Awesome RL-based LLM Reasoning
IAAR-Shanghai/Awesome-Attention-Heads
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
inclusionAI/Ling
Ling is a MoE LLM provided and open-sourced by InclusionAI.
YangLing0818/SuperCorrect-llm
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
nl4opt/ORQA
[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.
tsinghua-fib-lab/SmartAgent
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
CodeEval-Pro/CodeEval-Pro
Official repo for "HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Task"
bowen-upenn/llm_token_bias
[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
pittisl/PhyT2V
official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
UKPLab/emnlp2024-code-prompting
Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024
Cristian-Curaba/CryptoFormalEval
We introduce a benchmark for testing how well LLMs can find vulnerabilities in cryptographic protocols. By combining LLMs with symbolic reasoning tools like Tamarin, we aim to improve the efficiency and thoroughness of protocol analysis, paving the way for future AI-powered cybersecurity defenses.
ethicalabs-ai/ouroboros
Self-Improving LLMs Through Iterative Refinement
yahskapar/LLMs-and-Probabilistic-Reasoning
Data and software artifacts for the EMNLP 2024 (Main) paper "What Are the Odds? Language Models Are Capable of Probabilistic Reasoning"
hriaz17/SayLessRAG
Code for the paper: "Say Less, Mean More: Leveraging Pragmatics in Retrieval-Augmented Generation"
tegridydev/mixture-of-persona-research
A “Mixture of Perspectives” Framework for Ethical AI
kang-ml/chain_of_thought_with_guidance
Implement CoT using guidance-ai
ogrnv/Creating-sample-means-for-measurement-standards-of-intelligence
Creating sample means for measurement standards of intelligence