Pinned Repositories
aamas_19
Source code for the paper "Online Abstraction with MDP Homomorphisms for Deep Learning".
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
ai-deadlines
:alarm_clock: AI conference deadline countdowns
AlphaGOZero-python-tensorflow
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Arcade-Learning-Environment
The Arcade Learning Environment (ALE) -- a platform for AI research.
ArraySortAlgorithm
各个排序算法
atari-py
An `openai/atari-py` fork with Windows support and removed zlib/libpng dependencies. Binaries (wheels) are on "Releases" tab.
Synthetic-PandoraHearts-Jack
TStarBots
Lamperougeyxy's Repositories
Lamperougeyxy/bert
TensorFlow code and pre-trained models for BERT
Lamperougeyxy/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Lamperougeyxy/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Lamperougeyxy/deeprl_network
multi-agent deep reinforcement learning for networked system control.
Lamperougeyxy/deeprl_signal_control
multi-agent deep reinforcement learning for large-scale traffic signal control.
Lamperougeyxy/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
Lamperougeyxy/dreamer-1
Dream to Control: Learning Behaviors by Latent Imagination
Lamperougeyxy/EITI-EDTI
Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)
Lamperougeyxy/ghostnet
[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"
Lamperougeyxy/ghostnet.pytorch
[CVPR2020] Surpassing MobileNetV3: "GhostNet: More Features from Cheap Operations"
Lamperougeyxy/hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
Lamperougeyxy/jmlr-style-file
LaTeX style file for the Journal of Machine Learning Research
Lamperougeyxy/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Lamperougeyxy/MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
Lamperougeyxy/mentalRL
Code for our AAMAS 2020 paper: "A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry".
Lamperougeyxy/MPHRL
Model Primitive Hierarchical Reinforcement Learning
Lamperougeyxy/NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
Lamperougeyxy/pymoo
NSGA2, NSGA3, R-NSGA3, MOEAD, GA, DE,
Lamperougeyxy/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Lamperougeyxy/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Lamperougeyxy/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Lamperougeyxy/Reinforcement-Learning-from-Hierarchical-Critics
Reinforcement Learning from Hierarchical Critics
Lamperougeyxy/RL-Papers
papers about reinforcement learning
Lamperougeyxy/ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
Lamperougeyxy/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Lamperougeyxy/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Lamperougeyxy/UnsupervisedAttentionMechanism
Code for our paper: "Unsupervised Attention Mechanism across Neural Network Layers".
Lamperougeyxy/VAE-Pytorch
Lamperougeyxy/vscode-rainbow-fart
一个在你编程时疯狂称赞你的 VSCode 扩展插件 | An VSCode extension that keeps giving you compliment while you are coding, it will checks the keywords of code to play suitable sounds.
Lamperougeyxy/ZOOpt
A python package of Zeroth-Order Optimization (ZOOpt)