Yangyang0906C

Yangyang0906C's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python170k 1.5k 3.1k44.8k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python37.6k 220 5.7k4.6k
windingwind/zotero-pdf-translate
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.
Language:TypeScript7.9k 23 848367
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.9k 111 137421
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML4.6k 23 8533
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.7k 28 381349
google-research/football
Check out the new game server:
Language:Python3.4k 94 3211.3k
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python1.9k 30 131391
multimodal-art-projection/MAP-NEO
Language:Python900 11 3484
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
Language:Python480 20 2851
shariqiqbal2810/REFIL
Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021
Language:Python64 2 313
JiwenJ/Awesome-RL
A curated list of RL resources
33 3 06
yinyueqin/relative-preference-optimization
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
Language:Python20 2 01