Pinned Repositories
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
easy-rl
强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github.io/easy-rl/
EdgeFed-MARL-MEC
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
fair_flearn
Fair Resource Allocation in Federated Learning (ICLR '20)
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Mava
A library of multi-agent reinforcement learning components and systems
Multi-Agent-Deep-Deterministic-Policy-Gradients
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
ffxu1024's Repositories
ffxu1024/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥
ffxu1024/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
ffxu1024/CS-Books
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
ffxu1024/easy-rl
强化学习中文教程(蘑菇书),在线阅读地址:https://datawhalechina.github.io/easy-rl/
ffxu1024/EdgeFed-MARL-MEC
ffxu1024/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
ffxu1024/fair_flearn
Fair Resource Allocation in Federated Learning (ICLR '20)
ffxu1024/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
ffxu1024/Mava
A library of multi-agent reinforcement learning components and systems
ffxu1024/Multi-Agent-Deep-Deterministic-Policy-Gradients
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
ffxu1024/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
ffxu1024/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
ffxu1024/Practicing-Federated-Learning
ffxu1024/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
ffxu1024/pytorch-A3C
Simple A3C implementation with pytorch + multiprocessing
ffxu1024/Reinforce
Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments
ffxu1024/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
ffxu1024/TinyWebServer
:fire: Linux下C++轻量级Web服务器
ffxu1024/UAV