jxzhangjhu

AI Researcher on LLM reliability, optimization, and alignment

Intuit AI ResearchMountain View

jxzhangjhu's Stars

dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
10.7k 895 6639
kyegomez/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.gg/jM3Z6M9uMq
Language:Python3.9k 53 294450
togethercomputer/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Language:Python2.6k 35 18363
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
2k 51 10110
maitrix-org/llm-reasoners
A library for advanced large language model reasoning
Language:Python1.6k 19 50141
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Language:Python1.4k 7 68116
deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
Language:Python1.3k 23 2367
srush/awesome-o1
A bibliography and survey of the papers surrounding o1
Language:TeX1k 24 042
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Language:Python940 13 2965
zhentingqi/rStar
Language:Python720 7 2182
open-thought/system-2-research
System 2 Reasoning Link Collection
714 23 758
ezelikman/quiet-star
Code for Quiet-STaR
Language:Python693 12 1389
THUDM/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
Language:Python502 4 1638
lqtrung1998/mwp_ReFT
Language:Python424 5 852
OpenBMB/Eurus
Language:Python297 11 1114
MARIO-Math-Reasoning/Super_MARIO
Language:Python288 12 3023
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
Language:Jupyter Notebook261 1 528
ezelikman/STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
Language:Python192 4 121
kanishkg/stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
Language:Python116 1 714
McGill-NLP/VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
Language:Python109 5 510
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
Language:Python97 3 110
sail-sg/CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
Language:Python85 2 43
zhiyuanhubj/UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
Language:Python83 3 34
ConsequentAI/fneval
Functional Benchmarks and the Reasoning Gap
Language:TeX78 1 82
FreedomIntelligence/OVM
Language:Python61 11 84
OSU-NLP-Group/llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
Language:Python52 4 23
hbin0701/Self-Explore
[EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards
Language:Python48 1 12
psunlpgroup/ReaLMistake
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
Language:Python27 8 03
scaleapi/plansearch
e
Language:Jupyter Notebook17 3 13
zwc662/hyqe
Language:Python51

jxzhangjhu

jxzhangjhu's Stars

dair-ai/ML-Papers-of-the-Week

kyegomez/swarms

togethercomputer/MoA

zjunlp/LLMAgentPapers

maitrix-org/llm-reasoners

openreasoner/openr

deepseek-ai/Janus

srush/awesome-o1

deepseek-ai/DeepSeek-Math

zhentingqi/rStar

open-thought/system-2-research

ezelikman/quiet-star

THUDM/ReST-MCTS

lqtrung1998/mwp_ReFT

OpenBMB/Eurus

MARIO-Math-Reasoning/Super_MARIO

YuxiXie/MCTS-DPO

ezelikman/STaR

kanishkg/stream-of-search

McGill-NLP/VinePPO

kyegomez/Lets-Verify-Step-by-Step

sail-sg/CPO

zhiyuanhubj/UoT

ConsequentAI/fneval

FreedomIntelligence/OVM

OSU-NLP-Group/llm-planning-eval

hbin0701/Self-Explore

psunlpgroup/ReaLMistake

scaleapi/plansearch

zwc662/hyqe