timefly-1989

Pinned Repositories

abstreet
A traffic simulation game exploring how small changes to roads affect cyclists, transit users, pedestrians, and drivers.
Language:Rust00
alignment
ELIGN: Expectation Alignment as a Multi-agent Intrinsic Reward
Language:Python0 0 00
Anarcho_AV---Multi-agent-RL-for-AV-traffic-clearance
Language:Python00
ATOC
an implementation of ATOC
Language:Python0 0 01
atoc_coma
Language:Python0 0 00
ATOC_COMA_PyTorch
Language:Python00
AttA2C
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
Language:Jupyter Notebook00
Attention-Actor-Critic-for-Atari-Learning
A Reinforcement Learning algorithm with attention mechanism for Atari Learning
Language:Python00
Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
Language:Python0 0 00
DRQN-pytorch
[PYTORCH] Simple implementation of DQN, DRQN, A3C, PPO in Atari Breakout
Language:Python2 0 02

timefly-1989's Repositories

timefly-1989/DRQN-pytorch
[PYTORCH] Simple implementation of DQN, DRQN, A3C, PPO in Atari Breakout
Language:Python2 0 02
timefly-1989/ATOC
an implementation of ATOC
Language:Python0 0 01
timefly-1989/deep-MARL-papers
[WIP✏] Paper list of deep multi-agent reinforcement learning (deep MARL)
timefly-1989/DeepLearning-500-questions
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，50余万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06
timefly-1989/deeplearningbook-chinese
Deep Learning Book Chinese Translation
timefly-1989/DIRAL
Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V Communication
timefly-1989/DSN
This repository is a pure environment of DSNs.
timefly-1989/DynEnv
Dynamic Simulation Environments for Reinforcement Learning
timefly-1989/epciclr2020
timefly-1989/flock_env
Boid flock multi-agent RL training environment
timefly-1989/FQA
Multi-agent Trajectory Prediction with Fuzzy Query Attention
timefly-1989/gym-pybullet-drones
PyBullet Gym environments for single and multi-agent reinforcement learning of quadcopter control
timefly-1989/hanabi_SAD
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
timefly-1989/HiT-MAC
This repository is the official implementation of Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor Networks.
timefly-1989/IC3Net
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
timefly-1989/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
timefly-1989/MAAC-1
timefly-1989/MAProj
Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment
timefly-1989/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
timefly-1989/mdde-MAAC
Actor-Attention-Critic for Multi-Agent Reinforcement Learning for Multi-agent Data Distribution Environment
timefly-1989/Multi_UAV
Language:Python
timefly-1989/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
timefly-1989/PARL
A high-performance distributed training framework for Reinforcement Learning
timefly-1989/policy_based_RL
The implement of the policy gradient RL algorithm with pytorch
timefly-1989/pyGAT
Pytorch implementation of the Graph Attention Network model by Veličković et. al (2017, https://arxiv.org/abs/1710.10903)
timefly-1989/pygcn
Graph Convolutional Networks in PyTorch
timefly-1989/StarCraft
Implementations of QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
timefly-1989/Super-mario-bros-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
timefly-1989/VBC
pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"
timefly-1989/webgs
LAR-19641-1: WebGS: Web-based Platform for Multi-UAV Flight Visualization and Simulation