jxzhangjhu

AI Researcher on LLM reliability, optimization, and alignment

Intuit AI ResearchMountain View

Pinned Repositories

DCR-consistency
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
Language:Python22 4 03
sac3
Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency
Language:Jupyter Notebook33 11 17
awesome-LLM-controlled-decoding-generation
awesome-LLM-controlled-constrained-generation
31 1 03
Awesome-LLM-Prompt-Optimization
Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models
254 8 19
Awesome-LLM-RAG
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
1k 10 063
Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
683 25 247
Awesome-OOD-detection
SOTA work about out-of-distribution detection
13 2 11
MatDesINNe
Inverse materials design via invertible neural networks
Language:Jupyter Notebook61 3 213
UQpy
UQpy (Uncertainty Quantification with python) is a general purpose Python toolbox for modeling uncertainty in physical and mathematical systems.
Language:Python281 17 12981
GGL
A pytorch implementation of the paper "Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage".
Language:Jupyter Notebook58 4 015

jxzhangjhu's Repositories

jxzhangjhu/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
1 0 0
jxzhangjhu/jxzhangjhu.github.io
Language:HTML1 1 0
jxzhangjhu/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
1
jxzhangjhu/AdalFlow
AdalFlow: The library to build & auto-optimize any LLM task.
Language:Python0 0
jxzhangjhu/ai-algorithms
First-principle implementations of various AI algorithms using a wide range of deep learning frameworks, accompanied by relevant research papers
Language:Jupyter Notebook0 0
jxzhangjhu/awesome-o1
jxzhangjhu/bayesian-laws-icl
Bayesian scaling laws for in-context learning.
jxzhangjhu/CPO
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
jxzhangjhu/deep-learning-pytorch-huggingface
jxzhangjhu/haloscope
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
Language:Python0 0
jxzhangjhu/hyqe
jxzhangjhu/Janus
jxzhangjhu/knowledge-infused-ai
jxzhangjhu/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
jxzhangjhu/LLM-as-a-Judge
jxzhangjhu/LLMAgentPapers
Must-read Papers on LLM Agents.
jxzhangjhu/MoA
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
jxzhangjhu/modelscope-classroom
jxzhangjhu/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
0 0
jxzhangjhu/o1_Reasoning_Patterns_Study
jxzhangjhu/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
jxzhangjhu/plansearch
e
jxzhangjhu/RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
jxzhangjhu/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
jxzhangjhu/rStar
jxzhangjhu/SSO
A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimization".
jxzhangjhu/Super_MARIO
Language:Python0 0
jxzhangjhu/swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051935506503
Language:Python0 0
jxzhangjhu/system-2-research
System 2 Reasoning Link Collection
jxzhangjhu/VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
Language:Python0 0