wang90063

I'm a Ph.D. candidate in BUPT and studying in UCD now.

Pinned Repositories

async_deep_reinforce
Asynchronous Methods for Deep Reinforcement Learning
Language:Python2 3 01
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 3 00
botorch
Bayesian optimization in PyTorch
Language:Jupyter Notebook0 1 00
cnn_graph
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Language:Jupyter Notebook0 3 00
DDPG-Keras-Torcs
Using Keras and Deep Deterministic Policy Gradient to play TORCS
Language:Python1 3 00
MRO-Asyn-RL
Language:Python1 3 00
MT-MCTS
Multi-task Rl with MCTS
Language:Python2 3 00
MTRL
Three tasks
Language:Python1 3 00
RocAlphaGo
An independent, student-led replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" (Nature 529, 484-489, 28 Jan 2016), details of which can be found on their website https://deepmind.com/publications.html.
Language:Python1 3 00

wang90063's Repositories

wang90063/MT-MCTS
Multi-task Rl with MCTS
Language:Python2 3 00
wang90063/MRO-Asyn-RL
Language:Python1 3 00
wang90063/MTRL
Three tasks
Language:Python1 3 00
wang90063/RocAlphaGo
An independent, student-led replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" (Nature 529, 484-489, 28 Jan 2016), details of which can be found on their website https://deepmind.com/publications.html.
Language:Python1 3 00
wang90063/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python0 3 00
wang90063/botorch
Bayesian optimization in PyTorch
Language:Jupyter Notebook0 1 00
wang90063/codefuse
Index of the CodeFuse Repositories
0 1 00
wang90063/CS294
homework for CS294 Fall 2017
Language:Python3 0
wang90063/DDPG
Modifying the network structure in DDPG to solve the multi-agent problem
Language:Python3 0
wang90063/dlrover
DLRover: An Automatic Distributed Deep Learning System
Language:Python
wang90063/DQN
Language:Python3 0
wang90063/GA3C
Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.
Language:Python3 0
wang90063/gpytorch
A highly efficient implementation of Gaussian Processes in PyTorch
Language:Python1 0
wang90063/HEBO
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
Language:Jupyter Notebook
wang90063/JoshieGo
A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. Current strength is 3~4 amateur dan.
Language:Python3 0
wang90063/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python3 0
wang90063/ml_implementation
Implementation of Machine Learning Algorithms
Language:Python3 0
wang90063/MRO-meta
3 0
wang90063/q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
wang90063/Qwen-TensorRT-LLM
Language:Python1 0
wang90063/RLMLB
Language:Jupyter Notebook3 0
wang90063/rnn2d
CPU and GPU implementations of some 2D RNN layers
Language:C++3 0
wang90063/SafeOpt
Safe Bayesian Optimization
Language:Python2 0
wang90063/scalable_maddpg
scalable multi agents reinforcement learning
Language:Python3 0
wang90063/tensorflow-multi-dimensional-lstm
Multi dimensional LSTM as described in Alex Graves' Paper https://arxiv.org/pdf/0705.2011.pdf
Language:Python3 0
wang90063/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python1 0
wang90063/unreal
Reinforcement learning with unsupervised auxiliary tasks
Language:Python3 0
wang90063/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python1 0
wang90063/wang90063.github.io
Language:HTML3 0
wang90063/wechat_jump_game
python 微信《跳一跳》辅助
Language:Python