Pinned Repositories
Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
Discrete-SAC-PyTorch
PyTorch implementation of discrete version of Soft Actor-Critic.
Distributional-RL
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
DQN-HER
Implementation of the hindsight experience by DQN algorithm on the bit flip environment.
NN-Without-Frameworks
Let's build Neural Networks from scratch.
PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
Rainbow
Combining Improvements in Deep Reinforcement Learning
SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
alirezakazemipour's Repositories
alirezakazemipour/DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
alirezakazemipour/DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
alirezakazemipour/PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
alirezakazemipour/SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
alirezakazemipour/NN-Without-Frameworks
Let's build Neural Networks from scratch.
alirezakazemipour/Distributional-RL
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
alirezakazemipour/DQN-HER
Implementation of the hindsight experience by DQN algorithm on the bit flip environment.
alirezakazemipour/Rainbow
Combining Improvements in Deep Reinforcement Learning
alirezakazemipour/A3C-ACER-PyTorch
Implementation of ACER and A3C in PyTorch.
alirezakazemipour/Cycle-GAN-PyTorch
PyTorch implementation of the Cycle GAN paper.
alirezakazemipour/DeepRL-Paradise
Comprehensive Deep RL Implementations
alirezakazemipour/TRPO-PyTorch
Trust Region Policy Optimization in PyTorch.
alirezakazemipour/ACKTR-PyTorch
alirezakazemipour/DDQN-Random-Network-Distillation
alirezakazemipour/Parkinson-Disease-Classification
alirezakazemipour/TD3-PyTorch
Addressing Function Approximation Error in Actor-Critic Methods
alirezakazemipour/alirezakazemipour
alirezakazemipour/alirezakazemipour.github.io
alirezakazemipour/bandit-algorithms
A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB
alirezakazemipour/brett-daley.github.io
alirezakazemipour/Cartpole-RL
alirezakazemipour/Discrete-PPO
Implementation of the proximal policy optimization on the Atari environments.
alirezakazemipour/gymnax
RL Environments in JAX 🌍
alirezakazemipour/homework_fall2021
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)
alirezakazemipour/mon_mdp_neurips24
alirezakazemipour/PyExpUtils
Experiment utility code, specifically designed for use with Compute Canada.
alirezakazemipour/reinforcement_learning_an_introduction
Notes and exercise solutions for second edition of Sutton & Barto's book
alirezakazemipour/rl-prediction-template
alirezakazemipour/Top-50-Crypto-Kaggle
alirezakazemipour/TreeBased-And-SVM-Classifiers