Lamperougeyxy

Beijing Jiaotong University

Pinned Repositories

aamas_19
Source code for the paper "Online Abstraction with MDP Homomorphisms for Deep Learning".
Language:Python0 1 00
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
Language:Python0 1 00
ai-deadlines
:alarm_clock: AI conference deadline countdowns
Language:HTML0 1 00
AlphaGOZero-python-tensorflow
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
Language:Python0 1 00
anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Language:HTML0 1 00
Arcade-Learning-Environment
The Arcade Learning Environment (ALE) -- a platform for AI research.
Language:C++0 1 00
ArraySortAlgorithm
各个排序算法
Language:Java0 1 00
GHQ
Official simplified implementation for "GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning"
Language:Python10
Synthetic-PandoraHearts-Jack
Language:JavaScript1 2 00
TStarBots
1 1 00

Lamperougeyxy's Repositories

Lamperougeyxy/GHQ
Official simplified implementation for "GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning"
Language:Python10
Lamperougeyxy/Synthetic-PandoraHearts-Jack
Language:JavaScript1 2 00
Lamperougeyxy/ai-deadlines
:alarm_clock: AI conference deadline countdowns
Language:HTML0 1 00
Lamperougeyxy/anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Language:HTML0 1 00
Lamperougeyxy/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
Language:Python1 0
Lamperougeyxy/CityFlow
A Multi-Agent Reinforcement Learning Environment for Large Scale City Traffic Scenario
Language:C++0 0
Lamperougeyxy/DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
Language:Python1 0
Lamperougeyxy/facmac
Language:Python1 0
Lamperougeyxy/FinRL
FinRL: Financial Reinforcement Learning. 🔥
Language:Jupyter Notebook1 0
Lamperougeyxy/gem
Language:Python1 0
Lamperougeyxy/go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
Language:Python1 0
Lamperougeyxy/gps
Guided Policy Search
Language:Python1 0
Lamperougeyxy/handful-of-trials
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
Language:Python1 0
Lamperougeyxy/Homophily-MARL
Code for "Learning Homophilic Incentives in Sequential Social Dilemmas"
Language:Python1 0
Lamperougeyxy/LESSON
Language:Python1 0
Lamperougeyxy/MADDPG-1
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
Language:Python1 0
Lamperougeyxy/mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Language:Python1 0
Lamperougeyxy/MPE-Multiagent-RL-Algos
Simple verification experiments codes for multi-agent RL using OpenAI MPE environment
Language:Python1 0
Lamperougeyxy/multi-agent-PPO-on-SMAC
Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.
Language:Python1 0
Lamperougeyxy/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python1 0
Lamperougeyxy/Offline-Pre-trained-Multi-Agent-Decision-Transformer
Language:Python1 0
Lamperougeyxy/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python1 0
Lamperougeyxy/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python1 0
Lamperougeyxy/QPLEX
Language:Python1 0
Lamperougeyxy/smac_plus
An open source benchmark for Multi Agent Reinforcement Learning
Language:Python1 0
Lamperougeyxy/smacv2
Language:Python1 0
Lamperougeyxy/Synthetic-PandoraHearts
Language:JavaScript2 0
Lamperougeyxy/TRPO-in-MARL
Language:Python1 0
Lamperougeyxy/visualboyadvance-m
The continuing development of the legendary VBA gameboy advance emulator.
Language:C++1 0
Lamperougeyxy/wqmix
Code for Weighted QMIX
Language:Python1 0