zchuz's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
QwenLM/Qwen2.5-Coder
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
laekov/fastmoe
A fast MoE impl for PyTorch
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
princeton-nlp/SimPO
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
DmitryRyumin/AAAI-2024-Papers
AAAI 2024 Papers: Explore a comprehensive collection of innovative research papers presented at one of the premier artificial intelligence conferences. Seamlessly integrate code implementations for better understanding. ⭐ experience the forefront of progress in artificial intelligence with this repository!
hao-ai-lab/Consistency_LLM
[ICML 2024] CLLMs: Consistency Large Language Models
THUDM/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
zepingyu0512/awesome-llm-understanding-mechanism
awesome papers in LLM interpretability
HqWu-HITCS/Awesome-LLM-Survey
An Awesome Collection for LLM Survey
dvlab-research/Step-DPO
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
cooperleong00/Awesome-LLM-Interpretability
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
HITsz-TMG/awesome-llm-attributions
A Survey of Attributions for Large Language Models
lancopku/label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
wjn1996/Awesome-LLM-Reasoning-Openai-o1-Survey
The related works and background techniques about Openai o1
sail-sg/sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
kyegomez/Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
LightChen233/M3CoT
joykirat18/How-To-Think-Step-by-Step
How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning
LaVi-Lab/LongContextReasoner
[ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners