wwxFromTju

Make MAS(DRL) Great Again ! 🐶

DRL/MASTianjin China

Pinned Repositories

OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python6.1k 35 583600
MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Language:Python1k 10 156169
AR_tju
北洋AR，建立在天津大学录取通知书上，然后可以将学校的图像显示出来
Language:Objective-C3 1 02
awesome-reinforcement-learning-lib
GitHub's code repository is all you need
347 3 142
awesome-reinforcement-learning-zh
中文整理的强化学习资料（Reinforcement Learning）
2k 79 2361
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python129 7 125
DRL_trick
33 4 18
maddpg-tf
use tensorflow to implement the MADDPG(simple_tag)
Language:Python18 2 25
MARL-101
just for fun
13 1 03
sc2-101-zh
just for fun
Language:Python23 5 26

wwxFromTju's Repositories

wwxFromTju/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
347 3 142
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python129 7 125
wwxFromTju/MA-RLlib
Language:Python2 3 0
wwxFromTju/tju_rl_platform
Language:Python2 5 0
wwxFromTju/ASN_cloud
Language:Python1 2 0
wwxFromTju/hok_env
1 0 0
wwxFromTju/wwxFromTju.github.io
Language:HTML1 1 0
wwxFromTju/AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Language:Python0 0
wwxFromTju/aim
Aim — an easy-to-use and performant open-source experiment tracker.
Language:TypeScript0 0
wwxFromTju/alphastar
Language:Python0 0
wwxFromTju/dpo-rlaif
wwxFromTju/DyAN_backbone
Language:Python1 1
wwxFromTju/evogym
A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.
Language:Python0 0
wwxFromTju/evosax
Evolution Strategies in JAX 🦎
Language:Python0 0
wwxFromTju/ha_ma_ppo
Language:Python0 0
wwxFromTju/huggingface_rllib
Load and upload RLlib models from and to the Hub.
Language:Jupyter Notebook0 0
wwxFromTju/HumanoidAgents
Humanoid Agents: Platform for Simulating Human-like Generative Agents
Language:Python0 0
wwxFromTju/HumanSystemOptimization
健康学习到150岁 - 人体系统调优不完全指南
0 0
wwxFromTju/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python0 0
wwxFromTju/MAIC
The implementation of AAAI'22 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".
Language:Python0 0
wwxFromTju/MARS
MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.
Language:Jupyter Notebook0 0
wwxFromTju/muzero-cpp
A C++ pytorch implementation of MuZero
Language:C++0 0
wwxFromTju/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python0 0
wwxFromTju/rainbow_extend
Language:Python0 0
wwxFromTju/README
README文件语法解读，即Github Flavored Markdown语法介绍
0 0
wwxFromTju/smac_full_action_space
Language:Python2 0
wwxFromTju/sotopia
Language:Python0 0
wwxFromTju/summarize_from_feedback_details
Language:Python0 0
wwxFromTju/wilderness-scavenger
A platform for intelligent agent learning based on a 3D open-world FPS game developed by Inspir.AI.
Language:ASP.NET0 0
wwxFromTju/XAgent
An Autonomous LLM Agent for Complex Task Solving
Language:TypeScript0 0