ray075hl

729872400@qq.com

NWPUBeijing

ray075hl's Stars

QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Language:Python5.2k 43 407445
allenai/open-instruct
Language:Python2.3k 22 153263
codelion/optillm
Optimizing inference proxy for LLMs
Language:Python1.9k 23 51151
AIDC-AI/Marco-o1
An Open Large Reasoning Model for Real-World Solutions
Language:Python1.4k 19 2268
trotsky1997/MathBlackBox
Language:Python990 26 10101
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Language:Python970 19 965
NousResearch/Hermes-Function-Calling
Language:Jupyter Notebook773 14 2698
SimpleBerry/LLaMA-O1
Large Reasoning Models
Language:Python770 19 2543
lapisrocks/LanguageAgentTreeSearch
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Language:Python719 10 3073
zhentingqi/rStar
Language:Python709 7 2179
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
647 17 330
win4r/o1
Using Groq or OpenAI or Ollama to create o1-like reasoning chains
Language:Python290 6 444
collin-burns/discovering_latent_knowledge
Language:Python257 6 236
1989Ryan/llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.
Language:Python237 5 420
FlagOpen/FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
Language:Python205 8 1752
EleutherAI/elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
Language:Python192 6 9033
mattneary/attention
visualizing attention for LLM users
Language:Python184 2 18
FreedomIntelligence/Chain-of-Diagnosis
An interpretable large language model (LLM) for medical diagnosis.
Language:Python107 12 11
BrendanGraham14/mcts-llm
Language:Python100 1 319
mitmedialab/MDAgents
Official implementation for NeurIPS'24 paper: MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
Language:Python97 3 413
mbchang/meta-prompt
A re-implementation of Meta-Prompt in LangChain for building self-improving agents.
Language:Jupyter Notebook62 2 03
benlipkin/decoding
Composable inference algorithms with LLMs and programmable logic
Language:Python55 1 01
balevinstein/Probes
Language:Python44 1 211
jys5609/MC-LAVE-RL
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
Language:Python32 2 016
DIRECT-BIT/SRA-MCTS
Language:Python25 1 23
SIMONLQY/RethinkMCTS
Language:Python211
GiovanniGatti/socratic-llm
Training pipeline for fine tuning Phi-3-mini-instruct to follow the Socratic method
Language:Python16 4 11
hanqi-qi/Mirror
Language:Python13 1 10
amazon-science/factual-confidence-of-llms
Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"
Language:Python11 0 01
MrBlankness/TPO
Pytorch implementation of Tree Preference Optimization (TPO)
Language:Python10 1 10

ray075hl

ray075hl's Stars

QwenLM/Qwen-Agent

allenai/open-instruct

codelion/optillm

AIDC-AI/Marco-o1

trotsky1997/MathBlackBox

tencent-ailab/persona-hub

NousResearch/Hermes-Function-Calling

SimpleBerry/LLaMA-O1

lapisrocks/LanguageAgentTreeSearch

zhentingqi/rStar

Neph0s/awesome-llm-role-playing-with-persona

win4r/o1

collin-burns/discovering_latent_knowledge

1989Ryan/llm-mcts

FlagOpen/FlagScale

EleutherAI/elk

mattneary/attention

FreedomIntelligence/Chain-of-Diagnosis

BrendanGraham14/mcts-llm

mitmedialab/MDAgents

mbchang/meta-prompt

benlipkin/decoding

balevinstein/Probes

jys5609/MC-LAVE-RL

DIRECT-BIT/SRA-MCTS

SIMONLQY/RethinkMCTS

GiovanniGatti/socratic-llm

hanqi-qi/Mirror

amazon-science/factual-confidence-of-llms

MrBlankness/TPO