Victor-Martinez-Pozos

MachinaMexico City, Mexico

Pinned Repositories

Display-Controler-LIBtft144
In this repo I improve the performance of the LIBtft144 controller using numpy and vector operations, and use it to show in real-time the image stream from my raspberry-pi camera on a SPI 144 display
Language:Python0 0 00
DRL-DQN-Deep_Q_learning
Language:Jupyter Notebook00
DRL-HC-methods
In this repo I explore the Hill Climbing improvements like adaptative nopise scaling and cross-entropy to use them to solve the enviroment CartPole-v0 from OpenAI-GYM.
Language:Jupyter Notebook1 1 00
DRL-MC-control
Language:Jupyter Notebook00
DRL-MC-estimation
In this repo I use the Monte Carlo methods to estimate the value of the different (action, states) pairs in a black jack game, given an heuristic.
Language:Jupyter Notebook00
DRL-ND-Project_1
This repo contains the solution for the first project of the deep reinforcement learning nano degree from Udacity.
Language:Jupyter Notebook00
DRL-ND-Project_2
Language:Jupyter Notebook00
DRL-ND-Project_3
Language:Jupyter Notebook00
DRL-TD-methods
In this repo I explore the sarsa, sarsa max, and expected sarsa methods to solve RL tasks.
Language:Jupyter Notebook00
stacked_capsule_autoencoders
Forked project from google-research repo
Language:Python22

Victor-Martinez-Pozos's Repositories

Victor-Martinez-Pozos/stacked_capsule_autoencoders
Forked project from google-research repo
Language:Python22
Victor-Martinez-Pozos/DRL-HC-methods
In this repo I explore the Hill Climbing improvements like adaptative nopise scaling and cross-entropy to use them to solve the enviroment CartPole-v0 from OpenAI-GYM.
Language:Jupyter Notebook1 1 00
Victor-Martinez-Pozos/Display-Controler-LIBtft144
In this repo I improve the performance of the LIBtft144 controller using numpy and vector operations, and use it to show in real-time the image stream from my raspberry-pi camera on a SPI 144 display
Language:Python0 0 00
Victor-Martinez-Pozos/DRL-DQN-Deep_Q_learning
Language:Jupyter Notebook00
Victor-Martinez-Pozos/DRL-MC-control
Language:Jupyter Notebook00
Victor-Martinez-Pozos/DRL-MC-estimation
In this repo I use the Monte Carlo methods to estimate the value of the different (action, states) pairs in a black jack game, given an heuristic.
Language:Jupyter Notebook00
Victor-Martinez-Pozos/DRL-ND-Project_1
This repo contains the solution for the first project of the deep reinforcement learning nano degree from Udacity.
Language:Jupyter Notebook00
Victor-Martinez-Pozos/DRL-ND-Project_2
Language:Jupyter Notebook00
Victor-Martinez-Pozos/DRL-ND-Project_3
Language:Jupyter Notebook00
Victor-Martinez-Pozos/DRL-TD-methods
In this repo I explore the sarsa, sarsa max, and expected sarsa methods to solve RL tasks.
Language:Jupyter Notebook00
Victor-Martinez-Pozos/DRL-PG-PPO
Language:Jupyter Notebook
Victor-Martinez-Pozos/DRL-PG-reinforce
Language:Jupyter Notebook
Victor-Martinez-Pozos/DRL-Task-OpenAI_Gym_Taxi-v2
Language:Python
Victor-Martinez-Pozos/DRL-TD-Continuous_spaces
Language:Jupyter Notebook1 0
Victor-Martinez-Pozos/kaggle-Flower-Classification-with-TPUs
Language:Jupyter Notebook1 0
Victor-Martinez-Pozos/ShuffleNet-Series

Victor-Martinez-Pozos

Pinned Repositories

Display-Controler-LIBtft144

DRL-DQN-Deep_Q_learning

DRL-HC-methods

DRL-MC-control

DRL-MC-estimation

DRL-ND-Project_1

DRL-ND-Project_2

DRL-ND-Project_3

DRL-TD-methods

stacked_capsule_autoencoders

Victor-Martinez-Pozos's Repositories

Victor-Martinez-Pozos/stacked_capsule_autoencoders

Victor-Martinez-Pozos/DRL-HC-methods

Victor-Martinez-Pozos/Display-Controler-LIBtft144

Victor-Martinez-Pozos/DRL-DQN-Deep_Q_learning

Victor-Martinez-Pozos/DRL-MC-control

Victor-Martinez-Pozos/DRL-MC-estimation

Victor-Martinez-Pozos/DRL-ND-Project_1

Victor-Martinez-Pozos/DRL-ND-Project_2

Victor-Martinez-Pozos/DRL-ND-Project_3

Victor-Martinez-Pozos/DRL-TD-methods

Victor-Martinez-Pozos/DRL-PG-PPO

Victor-Martinez-Pozos/DRL-PG-reinforce

Victor-Martinez-Pozos/DRL-Task-OpenAI_Gym_Taxi-v2

Victor-Martinez-Pozos/DRL-TD-Continuous_spaces

Victor-Martinez-Pozos/kaggle-Flower-Classification-with-TPUs

Victor-Martinez-Pozos/ShuffleNet-Series