Michaelrising

Ph.D. Candidate at CityU @ HK

Hong Kong

Pinned Repositories

atari-reset
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
Language:Python00
awesome-causality-algorithms
An index of algorithms for learning causality with data
00
causal-rl
Language:Python00
Citadel-Datathon
Language:Jupyter Notebook00
contrib
Implementations of ideas from recent papers
Language:Python00
DesignCTPB
Language:R10
Engineering-optimization
Language:Python20
PPOProstateCancer
This project applies PPO to individulize the treatment policy for locally advanced prpstate cancer patients.
Language:HTML20
Prog-RL
Language:Python1 1 01
tfSSSD
Tensorflow version of SSSD
Language:Python1 1 00

Michaelrising's Repositories

Michaelrising/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Michaelrising/SCMMAB-NIPS2018
Structural Causal Bandit
Michaelrising/sac-discrete.pytorch
A PyTorch implementation of SAC-Discrete.
Language:Python
Michaelrising/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Michaelrising/N-gram-model-for-Hangman-game
Use different orders of N-gram model to play Hangman game.
Michaelrising/numba-scipy
numba_scipy extends Numba to make it aware of SciPy
Michaelrising/jax-rl
Jax (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Michaelrising/counterfactual-diagnosis
Michaelrising/DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Michaelrising/Time2Vec-PyTorch
Reproducing the paper: "Time2Vec: Learning a Vector Representation of Time" - https://arxiv.org/pdf/1907.05321.pdf
Michaelrising/xitorch
Differentiable scientific computing library
Michaelrising/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Michaelrising/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
Michaelrising/contrib
Implementations of ideas from recent papers
Michaelrising/DesignCTPB
Language:R1
Michaelrising/gail-airl-ppo.pytorch
A PyTorch implementation of GAIL and AIRL based on PPO.
Michaelrising/GraphEmbedding
Implementation and experiments of graph embedding algorithms.
Michaelrising/tf-diffwave
Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis
Michaelrising/datasciencecoursera
for Data Science class on Coursera
Michaelrising/MaxUrbtixBusyBot
redirect to www.urbtix.hk
Michaelrising/mle-for-population-params-binomial
Matlab codes for MLE for learning populations of parameters
Michaelrising/DRL
Deconfounding Reinforcement Learning in Observational Settings
Michaelrising/mazelab
A customizable framework to create maze and gridworld environments
Michaelrising/Integrating-evolutionary-dynamics-into-treatment-of-mCRPC
Michaelrising/atari-reset
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
Michaelrising/gym-soccer
Michaelrising/pytorch-ddpg
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
Michaelrising/Stock-Time-Series-Analysis
Mathematical modeling for finantial time series data
Michaelrising/cuda-floyd_warshall
CUDA implementation of the Blocked Floyd Warshall All pairs shortest path graph algorithm
Michaelrising/urbtix
buy ticket