luxinyu1's Stars
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
google-deepmind/tracr
protagolabs/odyssey-math
ucl-dark/llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
abdulhaim/LMRL-Gym
Linear95/SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
teacherpeterpan/self-correction-llm-papers
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
zchuz/CoT-Reasoning-Survey
[ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
stanford-crfm/ecosystem-graphs
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Edward-Sun/easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Edward-Sun/gpt-accelera
Simple and efficient pytorch-native transformer training and inference (batched)
THUDM/AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
xai-org/grok-1
Grok open release
princeton-nlp/SWE-bench
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
openai/transformer-debugger
ArthurConmy/Automatic-Circuit-Discovery
openai/democratic-inputs
arcee-ai/mergekit
Tools for merging pretrained large language models.
conversationai/perspectiveapi
Perspective is an API that uses machine learning models to score the perceived impact a comment might have on a conversation. See https://developers.perspectiveapi.com for more information.
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
openai/weak-to-strong
jxzhangjhu/Awesome-LLM-Uncertainty-Reliability-Robustness
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
kagisearch/pyllms
Minimal Python library to connect to LLMs (OpenAI, Anthropic, Google, Groq, Reka, Together, AI21, Cohere, Aleph Alpha, HuggingfaceHub), with a built-in model performance benchmark.
songquanpeng/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences