jxzhangjhu
AI Researcher on LLM reliability, optimization, and alignment
Intuit AI ResearchMountain View
jxzhangjhu's Stars
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
kyegomez/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.gg/jM3Z6M9uMq
togethercomputer/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
srush/awesome-o1
A bibliography and survey of the papers surrounding o1
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
zhentingqi/rStar
open-thought/system-2-research
System 2 Reasoning Link Collection
ezelikman/quiet-star
Code for Quiet-STaR
THUDM/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
lqtrung1998/mwp_ReFT
OpenBMB/Eurus
MARIO-Math-Reasoning/Super_MARIO
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
ezelikman/STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
kanishkg/stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
McGill-NLP/VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
sail-sg/CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
zhiyuanhubj/UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
ConsequentAI/fneval
Functional Benchmarks and the Reasoning Gap
FreedomIntelligence/OVM
OSU-NLP-Group/llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
hbin0701/Self-Explore
[EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards
psunlpgroup/ReaLMistake
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
scaleapi/plansearch
e
zwc662/hyqe