alirezakazemipour

MSc in CS

University of AlbertaEdmonton, AB

Pinned Repositories

Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
Language:Python27 2 23
DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
Language:Python91 2 518
DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
Language:Python62 2 424
Discrete-SAC-PyTorch
PyTorch implementation of discrete version of Soft Actor-Critic.
Language:Python31 3 16
Distributional-RL
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
Language:Python9 1 02
DQN-HER
Implementation of the hindsight experience by DQN algorithm on the bit flip environment.
Language:Python6 3 01
NN-Without-Frameworks
Let's build Neural Networks from scratch.
Language:Python13 2 02
PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
Language:Python44 2 28
Rainbow
Combining Improvements in Deep Reinforcement Learning
Language:Python6 2 10
SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
Language:Python22 2 04

alirezakazemipour's Repositories

alirezakazemipour/DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
Language:Python91 2 518
alirezakazemipour/DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
Language:Python62 2 424
alirezakazemipour/PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
Language:Python44 2 28
alirezakazemipour/SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
Language:Python22 2 04
alirezakazemipour/NN-Without-Frameworks
Let's build Neural Networks from scratch.
Language:Python13 2 02
alirezakazemipour/Distributional-RL
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
Language:Python9 1 02
alirezakazemipour/DQN-HER
Implementation of the hindsight experience by DQN algorithm on the bit flip environment.
Language:Python6 3 01
alirezakazemipour/Rainbow
Combining Improvements in Deep Reinforcement Learning
Language:Python6 2 10
alirezakazemipour/A3C-ACER-PyTorch
Implementation of ACER and A3C in PyTorch.
Language:Python4 2 02
alirezakazemipour/Cycle-GAN-PyTorch
PyTorch implementation of the Cycle GAN paper.
Language:Python4 2 00
alirezakazemipour/DeepRL-Paradise
Comprehensive Deep RL Implementations
3 1 01
alirezakazemipour/TRPO-PyTorch
Trust Region Policy Optimization in PyTorch.
Language:Python2 2 0
alirezakazemipour/ACKTR-PyTorch
Language:Python1 1 01
alirezakazemipour/DDQN-Random-Network-Distillation
Language:Python1 2 0
alirezakazemipour/Parkinson-Disease-Classification
Language:Jupyter Notebook1 1 0
alirezakazemipour/TD3-PyTorch
Addressing Function Approximation Error in Actor-Critic Methods
Language:Python1 2 01
alirezakazemipour/alirezakazemipour
1 0
alirezakazemipour/alirezakazemipour.github.io
Language:JavaScript2 0
alirezakazemipour/bandit-algorithms
A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB
Language:Python0 0
alirezakazemipour/brett-daley.github.io
Language:HTML0 0
alirezakazemipour/Cartpole-RL
Language:Python2 0
alirezakazemipour/Discrete-PPO
Implementation of the proximal policy optimization on the Atari environments.
Language:Python2 01
alirezakazemipour/gymnax
RL Environments in JAX 🌍
Language:Python0 0
alirezakazemipour/homework_fall2021
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)
Language:Python0 0
alirezakazemipour/mon_mdp_neurips24
Language:Python0 0
alirezakazemipour/PyExpUtils
Experiment utility code, specifically designed for use with Compute Canada.
Language:Python0 0
alirezakazemipour/reinforcement_learning_an_introduction
Notes and exercise solutions for second edition of Sutton & Barto's book
Language:TeX0 0
alirezakazemipour/rl-prediction-template
Language:Python0 0
alirezakazemipour/Top-50-Crypto-Kaggle
Language:Python1 0
alirezakazemipour/TreeBased-And-SVM-Classifiers
Language:Jupyter Notebook1 0