ray075hl's Stars
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
allenai/open-instruct
codelion/optillm
Optimizing inference proxy for LLMs
AIDC-AI/Marco-o1
An Open Large Reasoning Model for Real-World Solutions
trotsky1997/MathBlackBox
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
NousResearch/Hermes-Function-Calling
SimpleBerry/LLaMA-O1
Large Reasoning Models
lapisrocks/LanguageAgentTreeSearch
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
zhentingqi/rStar
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
win4r/o1
Using Groq or OpenAI or Ollama to create o1-like reasoning chains
collin-burns/discovering_latent_knowledge
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
FlagOpen/FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
EleutherAI/elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
mattneary/attention
visualizing attention for LLM users
FreedomIntelligence/Chain-of-Diagnosis
An interpretable large language model (LLM) for medical diagnosis.
BrendanGraham14/mcts-llm
mitmedialab/MDAgents
Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
mbchang/meta-prompt
A re-implementation of Meta-Prompt in LangChain for building self-improving agents.
benlipkin/decoding
Composable inference algorithms with LLMs and programmable logic
balevinstein/Probes
jys5609/MC-LAVE-RL
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
DIRECT-BIT/SRA-MCTS
SIMONLQY/RethinkMCTS
GiovanniGatti/socratic-llm
Training pipeline for fine tuning Phi-3-mini-instruct to follow the Socratic method
hanqi-qi/Mirror
amazon-science/factual-confidence-of-llms
Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"
MrBlankness/TPO
Pytorch implementation of Tree Preference Optimization (TPO)