Wang-Xiaoyang
Lecturer in AI. Interests include reinforcement learning, 6G communications and signal processing.
University of Exeter
Pinned Repositories
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
Bin-Packing-Env
DIFUSCO
Code for "DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization"
labsheets
Lab worksheets for the Applied Deep Learning Course.
model_ensemble_meta_learning
pedsim_ros
ROS packages for PedSim (Pedestrian Simulator) based on social force model
planet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
PPO-Implemnetation
Implementation of PPO for CartPole-v1
RL-Implementations
Implementations for classic reinforcement learning algorithms, using Gym environment.
Super-mario-bros-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Wang-Xiaoyang's Repositories
Wang-Xiaoyang/RL-Implementations
Implementations for classic reinforcement learning algorithms, using Gym environment.
Wang-Xiaoyang/Super-mario-bros-PPO-pytorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
Wang-Xiaoyang/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
Wang-Xiaoyang/Bin-Packing-Env
Wang-Xiaoyang/DIFUSCO
Code for "DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization"
Wang-Xiaoyang/labsheets
Lab worksheets for the Applied Deep Learning Course.
Wang-Xiaoyang/model_ensemble_meta_learning
Wang-Xiaoyang/pedsim_ros
ROS packages for PedSim (Pedestrian Simulator) based on social force model
Wang-Xiaoyang/planet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
Wang-Xiaoyang/PPO-Implemnetation
Implementation of PPO for CartPole-v1
Wang-Xiaoyang/Python-workshop
Wang-Xiaoyang/resource_packing_self_play
Wang-Xiaoyang/social-lstm
Social LSTM implementation in PyTorch
Wang-Xiaoyang/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Wang-Xiaoyang/Test-on-Drone-Dataset
Wang-Xiaoyang/UARA
J. Liu, X. Tao and J. Lu, "Mobility-Aware Centralized Reinforcement Learning for Dynamic Resource Allocation in HetNets," accepted by IEEE GLOBECOM 2019.
Wang-Xiaoyang/UESTCthesis
电子科技大学毕设设计论文LaTeX模板
Wang-Xiaoyang/ug_project_power_adjustment
Wang-Xiaoyang/VAE-CVAE-MNIST
Variational Autoencoder and Conditional Variational Autoencoder on MNIST in PyTorch
Wang-Xiaoyang/Wang-Xiaoyang.github.io
Just a plain, simple and elegant one-page theme for research/academia.
Wang-Xiaoyang/workshop
demo repo
Wang-Xiaoyang/Workstation-report