Pinned Repositories
3D-bin-packing
Solving the 3D bin packing problem with reinforcement learning
aop
Codebase for Adaptive Online Planning
berkeley-deep-RL-pytorch-solutions
Pytorch solutions for UC Berkeley's cs285 assignments
Collab_DRL
Collab_PyTorch_DRL
Material for a short tutorial about PyTorch and Stable Baselines
CuriousAgents
This repo explores ideas that can maybe in future be used to scale reinforcement learning to larger real world problems.
design-bench-mirror
marianophielipp.github.io
webapge
Mish
Mish: A Self Regularized Non-Monotonic Neural Activation Function
marianophielipp's Repositories
marianophielipp/3D-bin-packing
Solving the 3D bin packing problem with reinforcement learning
marianophielipp/aop
Codebase for Adaptive Online Planning
marianophielipp/berkeley-deep-RL-pytorch-solutions
Pytorch solutions for UC Berkeley's cs285 assignments
marianophielipp/Collab_DRL
marianophielipp/Collab_PyTorch_DRL
Material for a short tutorial about PyTorch and Stable Baselines
marianophielipp/CuriousAgents
This repo explores ideas that can maybe in future be used to scale reinforcement learning to larger real world problems.
marianophielipp/design-bench-mirror
marianophielipp/marianophielipp.github.io
webapge
marianophielipp/Mish
Mish: A Self Regularized Non-Monotonic Neural Activation Function
marianophielipp/Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
marianophielipp/practicals-2019
Practical notebooks for Khipu 2019, held in Universidad de la República in Montevideo.
marianophielipp/PyTorch
marianophielipp/PyTorch-RL
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
marianophielipp/Reinforcement_learning_tutorial_with_demo
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
marianophielipp/RL-Adventure-2
PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay
marianophielipp/rl-tutorial-jnrr19
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
marianophielipp/robotics-rl-srl
S-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics
marianophielipp/sau-explore
Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"
marianophielipp/website