jxzhangjhu
AI Researcher on LLM reliability, optimization, and alignment
Intuit AI ResearchMountain View
Pinned Repositories
DCR-consistency
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
sac3
Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
awesome-LLM-controlled-decoding-generation
awesome-LLM-controlled-constrained-generation
Awesome-LLM-Prompt-Optimization
Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models
Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
Awesome-OOD-detection
SOTA work about out-of-distribution detection
MatDesINNe
Inverse materials design via invertible neural networks
UQpy
UQpy (Uncertainty Quantification with python) is a general purpose Python toolbox for modeling uncertainty in physical and mathematical systems.
GGL
A pytorch implementation of the paper "Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage".
jxzhangjhu's Repositories
jxzhangjhu/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
jxzhangjhu/jxzhangjhu.github.io
jxzhangjhu/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
jxzhangjhu/AdalFlow
AdalFlow: The library to build & auto-optimize any LLM task.
jxzhangjhu/ai-algorithms
First-principle implementations of various AI algorithms using a wide range of deep learning frameworks, accompanied by relevant research papers
jxzhangjhu/awesome-o1
jxzhangjhu/bayesian-laws-icl
Bayesian scaling laws for in-context learning.
jxzhangjhu/CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
jxzhangjhu/deep-learning-pytorch-huggingface
jxzhangjhu/haloscope
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
jxzhangjhu/hyqe
jxzhangjhu/Janus
jxzhangjhu/knowledge-infused-ai
jxzhangjhu/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
jxzhangjhu/LLM-as-a-Judge
jxzhangjhu/LLMAgentPapers
Must-read Papers on LLM Agents.
jxzhangjhu/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
jxzhangjhu/modelscope-classroom
jxzhangjhu/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
jxzhangjhu/o1_Reasoning_Patterns_Study
jxzhangjhu/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
jxzhangjhu/plansearch
e
jxzhangjhu/RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
jxzhangjhu/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
jxzhangjhu/rStar
jxzhangjhu/SSO
A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimization".
jxzhangjhu/Super_MARIO
jxzhangjhu/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503
jxzhangjhu/system-2-research
System 2 Reasoning Link Collection
jxzhangjhu/VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"