Pinned Repositories
Display-Controler-LIBtft144
In this repo I improve the performance of the LIBtft144 controller using numpy and vector operations, and use it to show in real-time the image stream from my raspberry-pi camera on a SPI 144 display
DRL-DQN-Deep_Q_learning
DRL-HC-methods
In this repo I explore the Hill Climbing improvements like adaptative nopise scaling and cross-entropy to use them to solve the enviroment CartPole-v0 from OpenAI-GYM.
DRL-MC-control
DRL-MC-estimation
In this repo I use the Monte Carlo methods to estimate the value of the different (action, states) pairs in a black jack game, given an heuristic.
DRL-ND-Project_1
This repo contains the solution for the first project of the deep reinforcement learning nano degree from Udacity.
DRL-ND-Project_2
DRL-ND-Project_3
DRL-TD-methods
In this repo I explore the sarsa, sarsa max, and expected sarsa methods to solve RL tasks.
stacked_capsule_autoencoders
Forked project from google-research repo
Victor-Martinez-Pozos's Repositories
Victor-Martinez-Pozos/stacked_capsule_autoencoders
Forked project from google-research repo
Victor-Martinez-Pozos/DRL-HC-methods
In this repo I explore the Hill Climbing improvements like adaptative nopise scaling and cross-entropy to use them to solve the enviroment CartPole-v0 from OpenAI-GYM.
Victor-Martinez-Pozos/Display-Controler-LIBtft144
In this repo I improve the performance of the LIBtft144 controller using numpy and vector operations, and use it to show in real-time the image stream from my raspberry-pi camera on a SPI 144 display
Victor-Martinez-Pozos/DRL-DQN-Deep_Q_learning
Victor-Martinez-Pozos/DRL-MC-control
Victor-Martinez-Pozos/DRL-MC-estimation
In this repo I use the Monte Carlo methods to estimate the value of the different (action, states) pairs in a black jack game, given an heuristic.
Victor-Martinez-Pozos/DRL-ND-Project_1
This repo contains the solution for the first project of the deep reinforcement learning nano degree from Udacity.
Victor-Martinez-Pozos/DRL-ND-Project_2
Victor-Martinez-Pozos/DRL-ND-Project_3
Victor-Martinez-Pozos/DRL-TD-methods
In this repo I explore the sarsa, sarsa max, and expected sarsa methods to solve RL tasks.
Victor-Martinez-Pozos/DRL-PG-PPO
Victor-Martinez-Pozos/DRL-PG-reinforce
Victor-Martinez-Pozos/DRL-Task-OpenAI_Gym_Taxi-v2
Victor-Martinez-Pozos/DRL-TD-Continuous_spaces
Victor-Martinez-Pozos/kaggle-Flower-Classification-with-TPUs
Victor-Martinez-Pozos/ShuffleNet-Series