peiyingxin's Stars
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
openreasoner/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
Open-Source-O1/Open-O1
THUDM/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
microsoft/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
VITA-Group/LiGO
[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David Cox, Zhangyang Wang, Yoon Kim
xinychen/tensor-book
张量计算系列教程 (Tensor Computations Tutorials)
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
intelligent-machine-learning/dlrover
DLRover: An Automatic Distributed Deep Learning System
datawhalechina/pumpkin-book
《机器学习》(西瓜书)公式详解
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
ossu/computer-science
🎓 Path to a free self-taught education in Computer Science!
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
wshzd/Awesome-AIGC
AIGC资料汇总学习,持续更新......
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Adlik/model_optimizer
FlagAI-Open/FlagAI
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries