nric

Pinned Repositories

A2C_TF2.0_Keras_LunarLander
This is a TF2.0 Keras implementation of a A2C agent (tested for openai lunar lander v2)
Language:Python4 1 00
LSTM-Autoencoder-for-rare-events
An LSTM (binary classifier) Autoencoder to identify rare events written using TF2.0/Keras
Language:Jupyter Notebook41
LSTM-Cell-using-Numpy
An implementation of an LSTM cell puerly written using Numpy (and scipy).
Language:Python2 1 00
LunarLanderDuelingDQN
A minimalistic tensorflow.keras (tf2.0) implentaion of a dueling DQN to solve the LunarLander-v2 gym env.
Language:Python1 1 01
MonteCarloGridworld
This is my implementaion of a Monte Carlo Tree Search ML solution to gridworld. Part of Move37 - Capter 3-4. Credits go to LazyProgrammer
Language:Python10
ProximalPolicyOptimizationContinuousKeras
This is an Tensorflow 2.0 (Keras) implementation of a Open Ai's proximal policy optimization PPO algorithem for continuous action spaces.
Language:Python63
ProximalPolicyOptimizationKeras
This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.
Language:Python11 3 37
TF2.0_Keras_DDPG
A commented Tensorflow 2.0 Keras implementation of DDPG for open AI gym continuous environments.
Language:Python4 1 16
VAE_CNN_Keras_TF2.0
Two different implementations of a Variational Autoencoder VAE with convolutional Neural networks via Tesorflow 2.0/Keras.
Language:Python11
VanillaPolicyGradientAlgorithm
This is an Tensorflow.Keras (TF 2.0) based implementation of a vanilla Policy Gradient learner to solve OpenAi Gym's Cartpole.
Language:Python2 1 01

nric's Repositories

nric/ProximalPolicyOptimizationKeras
This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.
Language:Python11 3 37
nric/ProximalPolicyOptimizationContinuousKeras
This is an Tensorflow 2.0 (Keras) implementation of a Open Ai's proximal policy optimization PPO algorithem for continuous action spaces.
Language:Python63
nric/A2C_TF2.0_Keras_LunarLander
This is a TF2.0 Keras implementation of a A2C agent (tested for openai lunar lander v2)
Language:Python4 1 00
nric/LSTM-Autoencoder-for-rare-events
An LSTM (binary classifier) Autoencoder to identify rare events written using TF2.0/Keras
Language:Jupyter Notebook41
nric/TF2.0_Keras_DDPG
A commented Tensorflow 2.0 Keras implementation of DDPG for open AI gym continuous environments.
Language:Python4 1 16
nric/LSTM-Cell-using-Numpy
An implementation of an LSTM cell puerly written using Numpy (and scipy).
Language:Python2 1 00
nric/VanillaPolicyGradientAlgorithm
This is an Tensorflow.Keras (TF 2.0) based implementation of a vanilla Policy Gradient learner to solve OpenAi Gym's Cartpole.
Language:Python2 1 01
nric/LunarLanderDuelingDQN
A minimalistic tensorflow.keras (tf2.0) implentaion of a dueling DQN to solve the LunarLander-v2 gym env.
Language:Python1 1 01
nric/MonteCarloGridworld
This is my implementaion of a Monte Carlo Tree Search ML solution to gridworld. Part of Move37 - Capter 3-4. Credits go to LazyProgrammer
Language:Python10
nric/VAE_CNN_Keras_TF2.0
Two different implementations of a Variational Autoencoder VAE with convolutional Neural networks via Tesorflow 2.0/Keras.
Language:Python11
nric/A3C_Atari
This is an implementaion of an asyncronous advantage actor critic A3C algorithm to play open ai gym atari games.
Language:Python00
nric/AugmentedRandomSearchGym
An implementation for Augmented Radom Search algorithm solivng 2d open ai gym enviroments - tested for Box2d envs.
Language:Python
nric/AugmentedRandomSearchGymBipedalWalker
Augmented Random Search was chosen to make the Open AI Bipedal Walker walk. However, the Agent is general. It could be used for other gym environments as well without change. But probably hyper paramter would require adaptation.
Language:Jupyter Notebook1 0
nric/CartpoleSimpleDQN
This is a simple implementation of a Deep Q Network learning agent tested on Open AI Gym's cart pole.
Language:Jupyter Notebook
nric/Coursera_Capstone
Capstone Project for IBM Data Science Certificate
Language:Jupyter Notebook
nric/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
Language:Jupyter Notebook
nric/DeepQLeaningAtari
This is an TensorFlow Keras (TF2) implementaion of a vanilla Deep Q Leaning algorithm to play the OpenAI Gym Atari Games.
Language:Python
nric/DuelingDQNDoom
This is an implementation of a Dueling Deep Q Learning agent that learns to play Doom.
Language:Python
nric/GridworldQLearning
This is a solution to Gridworld using off policy Q Learning. As enviroment it emplys LazyProgrammers Grid_world.py https://github.com/lazyprogrammer/machine_learning_examples/tree/master/rl Solution is part of The School of AI's Move 37 Course https://www.theschool.ai/courses/move-37-course/ Written as a jupyter cell in visual studio code. Just run the cell. If you want to run using python interpreter directly, replace def main(): to if name == 'main': and remove the last line (call of main()).
Language:Python
nric/MonteCarloFrozenLake
A solution of Open Ai Gym FrozenLake-v0 using Monte Carlo first visit method in Python 3.6.7
Language:Python
nric/MonteCarloPolicyGradientAgent
This is a Monte Carlo Policy Gradient algorithm (somewhat) written using TF2.0 keras optimized to solve Open Ai Gym Lunar Lander.
Language:Python1 0
nric/MPG_DataAnalysis_With_TF2.0
A test project for some data anysis and a simple Neural Network (2 fully connected layers) written with Tensor Flow 2.0 to predict consumption of a vehicle.
Language:Jupyter Notebook1 0
nric/NeuroEvolutionLunarLander
This is a my minimalisic solution to the Lular Lander gym environment using an evolutionaly Neural Net aproach.
Language:Python
nric/performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
nric/QLearningTaxiV2
A Q Learning Agent solving the OpenAI Gym TaxiV2 environment. Despite the Hyper parameters, the agent should be able to solve all gym toy text enviroments as it is written very general. Run code using a Jupyter Notebook or VS Code connected to a jupyter server.
Language:Python1 0