Pinned Repositories
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
DRLib
DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.
MOSS-RLHF
MOSS-RLHF
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Anima
第一个开源的基于QLoRA的33B中文大语言模型First QLoRA based open source 33B Chinese LLM
auto-survey
使用GPT对给定的标题进行相关论文总结
ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文总结+润色+审稿+审稿回复
DRLib
My DRL library with tensorflow1.14 and pytorch, add HER and PER, core codes based on https://github.com/openai/spinningup
red-tie's Repositories
red-tie/auto-survey
使用GPT对给定的标题进行相关论文总结
red-tie/Anima
第一个开源的基于QLoRA的33B中文大语言模型First QLoRA based open source 33B Chinese LLM
red-tie/ChatPaper
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文总结+润色+审稿+审稿回复
red-tie/DRLib
My DRL library with tensorflow1.14 and pytorch, add HER and PER, core codes based on https://github.com/openai/spinningup