luxinyu1's Stars
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
yinzhangyue/SelfAware
Do Large Language Models Know What They Don’t Know?
FlagAI-Open/FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
princeton-nlp/TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
WHUIR/Cheese-LLM
OpenBMB/CPM-Bee
百亿参数的中英文双语基座大模型
InteractiveNLP-Team/awesome-InteractiveNLP-papers
Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246) :fire:
lifan0127/ai-research-assistant
Aria is Your AI Research Assistant Powered by GPT Large Language Models
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
catherinesyeh/attention-viz
Visualizing query-key interactions in language + vision transformers
OpenLMLab/GAOKAO-Bench
GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.
hkust-nlp/ceval
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
jd/tenacity
Retrying library for Python
pengsida/learning_research
本人的科研经验
zibuyu/research_tao
NLP研究入门之道
thunlp/ToolLearningPapers
thunlp/WebCPM
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
ai-shifu/ChatALL
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers
openai/automated-interpretability
WeOpenML/PandaLM
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
Hunter-DDM/knowledge-neurons
Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"
EleutherAI/concept-erasure
Erasing concepts from neural representations with provable guarantees
FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
haoliuhl/chain-of-hindsight
Simple next-token-prediction for RLHF