Pinned Repositories
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
AR_tju
北洋AR,建立在天津大学录取通知书上,然后可以将学校的图像显示出来
awesome-reinforcement-learning-lib
GitHub's code repository is all you need
awesome-reinforcement-learning-zh
中文整理的强化学习资料(Reinforcement Learning)
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
DRL_trick
maddpg-tf
use tensorflow to implement the MADDPG(simple_tag)
MARL-101
just for fun
sc2-101-zh
just for fun
wwxFromTju's Repositories
wwxFromTju/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
wwxFromTju/MA-RLlib
wwxFromTju/tju_rl_platform
wwxFromTju/ASN_cloud
wwxFromTju/hok_env
wwxFromTju/wwxFromTju.github.io
wwxFromTju/AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
wwxFromTju/aim
Aim — an easy-to-use and performant open-source experiment tracker.
wwxFromTju/alphastar
wwxFromTju/dpo-rlaif
wwxFromTju/DyAN_backbone
wwxFromTju/evogym
A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.
wwxFromTju/evosax
Evolution Strategies in JAX 🦎
wwxFromTju/ha_ma_ppo
wwxFromTju/huggingface_rllib
Load and upload RLlib models from and to the Hub.
wwxFromTju/HumanoidAgents
Humanoid Agents: Platform for Simulating Human-like Generative Agents
wwxFromTju/HumanSystemOptimization
健康学习到150岁 - 人体系统调优不完全指南
wwxFromTju/JaxMARL
Multi-Agent Reinforcement Learning with JAX
wwxFromTju/MAIC
The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".
wwxFromTju/MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
wwxFromTju/muzero-cpp
A C++ pytorch implementation of MuZero
wwxFromTju/purejaxrl
Really Fast End-to-End Jax RL Implementations
wwxFromTju/rainbow_extend
wwxFromTju/README
README文件语法解读,即Github Flavored Markdown语法介绍
wwxFromTju/smac_full_action_space
wwxFromTju/sotopia
wwxFromTju/summarize_from_feedback_details
wwxFromTju/wilderness-scavenger
A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.
wwxFromTju/XAgent
An Autonomous LLM Agent for Complex Task Solving