Pinned Repositories
async_deep_reinforce
Asynchronous Methods for Deep Reinforcement Learning
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
botorch
Bayesian optimization in PyTorch
cnn_graph
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
DDPG-Keras-Torcs
Using Keras and Deep Deterministic Policy Gradient to play TORCS
MRO-Asyn-RL
MT-MCTS
Multi-task Rl with MCTS
MTRL
Three tasks
RocAlphaGo
An independent, student-led replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" (Nature 529, 484-489, 28 Jan 2016), details of which can be found on their website https://deepmind.com/publications.html.
wang90063's Repositories
wang90063/MT-MCTS
Multi-task Rl with MCTS
wang90063/MRO-Asyn-RL
wang90063/MTRL
Three tasks
wang90063/RocAlphaGo
An independent, student-led replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search" (Nature 529, 484-489, 28 Jan 2016), details of which can be found on their website https://deepmind.com/publications.html.
wang90063/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
wang90063/botorch
Bayesian optimization in PyTorch
wang90063/codefuse
Index of the CodeFuse Repositories
wang90063/CS294
homework for CS294 Fall 2017
wang90063/DDPG
Modifying the network structure in DDPG to solve the multi-agent problem
wang90063/dlrover
DLRover: An Automatic Distributed Deep Learning System
wang90063/DQN
wang90063/GA3C
Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.
wang90063/gpytorch
A highly efficient implementation of Gaussian Processes in PyTorch
wang90063/HEBO
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
wang90063/JoshieGo
A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. Current strength is 3~4 amateur dan.
wang90063/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
wang90063/ml_implementation
Implementation of Machine Learning Algorithms
wang90063/MRO-meta
wang90063/q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
wang90063/Qwen-TensorRT-LLM
wang90063/RLMLB
wang90063/rnn2d
CPU and GPU implementations of some 2D RNN layers
wang90063/SafeOpt
Safe Bayesian Optimization
wang90063/scalable_maddpg
scalable multi agents reinforcement learning
wang90063/tensorflow-multi-dimensional-lstm
Multi dimensional LSTM as described in Alex Graves' Paper https://arxiv.org/pdf/0705.2011.pdf
wang90063/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
wang90063/unreal
Reinforcement learning with unsupervised auxiliary tasks
wang90063/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
wang90063/wang90063.github.io
wang90063/wechat_jump_game
python 微信《跳一跳》辅助