panyxy

PhD student at HKUST

HKUSTHong Kong

panyxy's Stars

tensorflow/models
Models and examples built with TensorFlow
Language:Python77.2k 2.7k 7.3k45.8k
shap/shap
A game theoretic approach to explain the output of any machine learning model.
Language:Jupyter Notebook22.8k 243 2.5k3.3k
tkipf/gcn
Implementation of Graph Convolutional Networks in TensorFlow
Language:Python7.1k 157 1942k
ibab/tensorflow-wavenet
A TensorFlow implementation of DeepMind's WaveNet paper
Language:Python5.4k 264 2781.3k
williamleif/GraphSAGE
Representation learning on large graphs using stochastic graph convolutions.
Language:Python3.4k 77 166843
PetarV-/GAT
Graph Attention Networks (https://arxiv.org/abs/1710.10903)
Language:Python3.2k 45 76645
mars-project/mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Language:Python2.7k 92 1.2k326
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
Language:Python1.9k 56 1k310
openai/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python1.6k 150 67490
vmayoral/basic_reinforcement_learning
An introductory series to Reinforcement Learning (RL) with comprehensive step-by-step tutorials.
Language:Jupyter Notebook1.1k 61 4357
shariqiqbal2810/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
Language:Python669 7 38173
eugenevinitsky/sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
Language:Python387 15 128132
minqi/learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Language:Python346 16 179
f90/Wave-U-Net-Pytorch
Improved Wave-U-Net implemented in Pytorch
Language:Python308 4 1363
alexfrom0815/Online-3D-BPP-DRL
This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.
Language:Python297 7 2067
andrew-j-levy/Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
Language:Python253 11 1261
IC3Net/IC3Net
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
Language:Python210 5 1149
TonghanWang/ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
Language:Python149 4 1434
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python130 7 125
hsvgbkhgbv/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Language:Python112 4 1042
madras-simulator/MADRaS
Multi-Agent DRiving Simulator
Language:Python89 6 1920
TonghanWang/NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
Language:Python82 5 716
Sonkyunghwan/QTRAN
There will be updates later
Language:Python80 1 415
turingaicloud/quickstart
https://tacc.ust.hk
Language:Python75 4 96
mzho7212/LICA
[NeurIPS 2020] PyTorch implementation of "Learning Implicit Credit Assignment for Cooperative Muti-Agent Reinforcement Learning"
Language:Python58 1 614
AnujMahajanOxf/MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
Language:Python57 7 1121
ml3705454/mapr2
Language:Python43 1 114
QDPP-GitHub/QDPP
Multi-Agent Determinantal Q-Learning
Language:Jupyter Notebook41 2 28
saizhang0218/TMC
Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"
Language:Python26 2 610
caslab-vt/SARNet
Code repository for SARNet: Learning Multi-Agent Communication through Structured Attentive Reasoning (NeurIPS 2020)
Language:Python24 4 68

panyxy

panyxy's Stars

tensorflow/models

shap/shap

tkipf/gcn

ibab/tensorflow-wavenet

williamleif/GraphSAGE

PetarV-/GAT

mars-project/mars

rlworkgroup/garage

openai/maddpg

vmayoral/basic_reinforcement_learning

shariqiqbal2810/MAAC

eugenevinitsky/sequential_social_dilemma_games

minqi/learning-to-communicate-pytorch

f90/Wave-U-Net-Pytorch

alexfrom0815/Online-3D-BPP-DRL

andrew-j-levy/Hierarchical-Actor-Critc-HAC-

IC3Net/IC3Net

TonghanWang/ROMA

wwxFromTju/deepmind_MAS_enviroment

hsvgbkhgbv/SQDDPG

madras-simulator/MADRaS

TonghanWang/NDQ

Sonkyunghwan/QTRAN

turingaicloud/quickstart

mzho7212/LICA

AnujMahajanOxf/MAVEN

ml3705454/mapr2

QDPP-GitHub/QDPP

saizhang0218/TMC

caslab-vt/SARNet