panyxy's Stars
tensorflow/models
Models and examples built with TensorFlow
shap/shap
A game theoretic approach to explain the output of any machine learning model.
tkipf/gcn
Implementation of Graph Convolutional Networks in TensorFlow
ibab/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
williamleif/GraphSAGE
Representation learning on large graphs using stochastic graph convolutions.
PetarV-/GAT
Graph Attention Networks (https://arxiv.org/abs/1710.10903)
mars-project/mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
openai/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
vmayoral/basic_reinforcement_learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
shariqiqbal2810/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
eugenevinitsky/sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
minqi/learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
f90/Wave-U-Net-Pytorch
Improved Wave-U-Net implemented in Pytorch
alexfrom0815/Online-3D-BPP-DRL
This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.
andrew-j-levy/Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
IC3Net/IC3Net
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
TonghanWang/ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
hsvgbkhgbv/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
madras-simulator/MADRaS
Multi-Agent DRiving Simulator
TonghanWang/NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
Sonkyunghwan/QTRAN
There will be updates later
turingaicloud/quickstart
https://tacc.ust.hk
mzho7212/LICA
[NeurIPS 2020] PyTorch implementation of "Learning Implicit Credit Assignment for Cooperative Muti-Agent Reinforcement Learning"
AnujMahajanOxf/MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
ml3705454/mapr2
QDPP-GitHub/QDPP
Multi-Agent Determinantal Q-Learning
saizhang0218/TMC
Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"
caslab-vt/SARNet
Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)