Pinned Repositories
aamas_19
Source code for the paper "Online Abstraction with MDP Homomorphisms for Deep Learning".
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
ai-deadlines
:alarm_clock: AI conference deadline countdowns
AlphaGOZero-python-tensorflow
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Arcade-Learning-Environment
The Arcade Learning Environment (ALE) -- a platform for AI research.
ArraySortAlgorithm
各个排序算法
GHQ
Official simplified implementation for "GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning"
Synthetic-PandoraHearts-Jack
TStarBots
Lamperougeyxy's Repositories
Lamperougeyxy/GHQ
Official simplified implementation for "GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning"
Lamperougeyxy/Synthetic-PandoraHearts-Jack
Lamperougeyxy/ai-deadlines
:alarm_clock: AI conference deadline countdowns
Lamperougeyxy/anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Lamperougeyxy/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
Lamperougeyxy/CityFlow
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
Lamperougeyxy/DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
Lamperougeyxy/facmac
Lamperougeyxy/FinRL
FinRL: Financial Reinforcement Learning. 🔥
Lamperougeyxy/gem
Lamperougeyxy/go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
Lamperougeyxy/gps
Guided Policy Search
Lamperougeyxy/handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Lamperougeyxy/Homophily-MARL
Code for "Learning Homophilic Incentives in Sequential Social Dilemmas"
Lamperougeyxy/LESSON
Lamperougeyxy/MADDPG-1
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
Lamperougeyxy/mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Lamperougeyxy/MPE-Multiagent-RL-Algos
Simple verification experiments codes for multi-agent RL using OpenAI MPE environment
Lamperougeyxy/multi-agent-PPO-on-SMAC
Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.
Lamperougeyxy/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Lamperougeyxy/Offline-Pre-trained-Multi-Agent-Decision-Transformer
Lamperougeyxy/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Lamperougeyxy/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Lamperougeyxy/QPLEX
Lamperougeyxy/smac_plus
An open source benchmark for Multi Agent Reinforcement Learning
Lamperougeyxy/smacv2
Lamperougeyxy/Synthetic-PandoraHearts
Lamperougeyxy/TRPO-in-MARL
Lamperougeyxy/visualboyadvance-m
The continuing development of the legendary VBA gameboy advance emulator.
Lamperougeyxy/wqmix
Code for Weighted QMIX