mabirck
CS Undergrad at UFPel(Federal University of Pelotas). Deep and Reinforcement Learning Undegrad Researcher at @ufpeldatalab.
@ufpeldatalab Pelotas-RS, Brazil
Pinned Repositories
adaptative-dropout-pytorch
Pytorch implementation of Adaptative Dropout a.ka Standout.
and-nd-firebase
Course code repository for Firebase in a Weekend by Google: Android
AndroidUdacity
Udacity course, implementations and code.
Atari-WGAN
Implementation of WGAN to generation of Atari Games Images. (GAN, WGAN, ATARI, Generative)
AttentionTRL
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
Berkeley_CS188
Repositorie containing solutions for the Berkeley CS188 class.
CatastrophicForgetting-EWC
#WORK IN PROGRESS PyTorch Implementation of Supervised and Deep Q-Learning EWC(Elastic Weight Consolidation), introduced in "Overcoming Catastrophic Forgetting in Neural Networks"
Geo_Classifier_siamese_NN
Geolocation generic classifier using siamese neural networks using TensorFlow and Keras.
joint_ppo_pytorch
Extension of https://github.com/ikostrikov/pytorch-a2c-ppo-acktr, making it feasible to run train on multiple games simultaneously.
Maml_Reptile_PyTorch
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks" in PyTorch
mabirck's Repositories
mabirck/CatastrophicForgetting-EWC
#WORK IN PROGRESS PyTorch Implementation of Supervised and Deep Q-Learning EWC(Elastic Weight Consolidation), introduced in "Overcoming Catastrophic Forgetting in Neural Networks"
mabirck/adaptative-dropout-pytorch
Pytorch implementation of Adaptative Dropout a.ka Standout.
mabirck/AttentionTRL
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
mabirck/Maml_Reptile_PyTorch
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks" in PyTorch
mabirck/Atari-WGAN
Implementation of WGAN to generation of Atari Games Images. (GAN, WGAN, ATARI, Generative)
mabirck/Geo_Classifier_siamese_NN
Geolocation generic classifier using siamese neural networks using TensorFlow and Keras.
mabirck/joint_ppo_pytorch
Extension of https://github.com/ikostrikov/pytorch-a2c-ppo-acktr, making it feasible to run train on multiple games simultaneously.
mabirck/and-nd-firebase
Course code repository for Firebase in a Weekend by Google: Android
mabirck/AndroidUdacity
Udacity course, implementations and code.
mabirck/Berkeley_CS188
Repositorie containing solutions for the Berkeley CS188 class.
mabirck/CS20SI_Tensorflow4DL_Research
Code and stuff from Stanford course on Tensorflow
mabirck/CS294-DeepRL
My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
mabirck/DDPG-Keras-Torcs
Using Keras and Deep Deterministic Policy Gradient to play TORCS
mabirck/Deep_RL_Bootcamp
Solutions for the labs in Deep RL Bootcamp.
mabirck/deeplearning_tutorials
Plenty of Deep Learning resources in companion with notebooks, for learning purposes.
mabirck/Generic_Seq2Seq
I replicate and make the original Seq2Seq from PyTorch tutorials to be easy to use and adapt.
mabirck/graphium
Let me try pimp a city with amazing graffitis
mabirck/joint_tf_ppo
Extension of https://github.com/openai/baselines, making it feasible to run train on multiple games simultaneously.
mabirck/Locally-Competitive-a2c
mabirck/modular_DeepRL
Attempt to implement A2C and PPO algorithm with modular properties of Maxout and LWTA. # UNFINISHED AND FAILED
mabirck/our-daily-paper
Paper List I have read or will read, just to keep control. (I should have done this before!!!)
mabirck/pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
mabirck/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
mabirck/pytorch-ewc
PyTorch implementation of DeepMind's PNAS 2017 paper "Overcoming Catastrophic Forgetting"
mabirck/pytorch-lr-scheduler
Bring some LR schedulers from Keras to PyTorch.
mabirck/Video_GAN_Sonic
[UNDERDEVELOPED, CHECK THE LINK BELOW] This was an early attempt to Generate a Sonic frame from past frames using GANs. I will open this cause there is plenty of useful infra code concerning steps to make it happen, besides no convergence is achieved in this repo!
mabirck/rl_a3c_pytorch
Reinforcement learning A3C LSTM Atari with Pytorch
mabirck/texufpel
Classe LaTeX para documentos da UFPel (especificamente documentos da Computação)
mabirck/ud851-Exercises
mabirck/ud851-Sunshine