mabirck

CS Undergrad at UFPel(Federal University of Pelotas). Deep and Reinforcement Learning Undegrad Researcher at @ufpeldatalab.

@ufpeldatalab Pelotas-RS, Brazil

Pinned Repositories

adaptative-dropout-pytorch
Pytorch implementation of Adaptative Dropout a.ka Standout.
Language:Python11 2 10
and-nd-firebase
Course code repository for Firebase in a Weekend by Google: Android
Language:Java1 2 00
AndroidUdacity
Udacity course, implementations and code.
1 2 00
Atari-WGAN
Implementation of WGAN to generation of Atari Games Images. (GAN, WGAN, ATARI, Generative)
Language:Python2 2 00
AttentionTRL
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
Language:Python8 5 01
Berkeley_CS188
Repositorie containing solutions for the Berkeley CS188 class.
Language:Python1 2 00
CatastrophicForgetting-EWC
#WORK IN PROGRESS PyTorch Implementation of Supervised and Deep Q-Learning EWC(Elastic Weight Consolidation), introduced in "Overcoming Catastrophic Forgetting in Neural Networks"
Language:Python25 2 14
Geo_Classifier_siamese_NN
Geolocation generic classifier using siamese neural networks using TensorFlow and Keras.
Language:Python2 4 01
joint_ppo_pytorch
Extension of https://github.com/ikostrikov/pytorch-a2c-ppo-acktr, making it feasible to run train on multiple games simultaneously.
Language:Python2 2 00
Maml_Reptile_PyTorch
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks" in PyTorch
Language:Python8 3 01

mabirck's Repositories

mabirck/CatastrophicForgetting-EWC
#WORK IN PROGRESS PyTorch Implementation of Supervised and Deep Q-Learning EWC(Elastic Weight Consolidation), introduced in "Overcoming Catastrophic Forgetting in Neural Networks"
Language:Python25 2 14
mabirck/adaptative-dropout-pytorch
Pytorch implementation of Adaptative Dropout a.ka Standout.
Language:Python11 2 10
mabirck/AttentionTRL
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
Language:Python8 5 01
mabirck/Maml_Reptile_PyTorch
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks" in PyTorch
Language:Python8 3 01
mabirck/Atari-WGAN
Implementation of WGAN to generation of Atari Games Images. (GAN, WGAN, ATARI, Generative)
Language:Python2 2 00
mabirck/Geo_Classifier_siamese_NN
Geolocation generic classifier using siamese neural networks using TensorFlow and Keras.
Language:Python2 4 01
mabirck/joint_ppo_pytorch
Extension of https://github.com/ikostrikov/pytorch-a2c-ppo-acktr, making it feasible to run train on multiple games simultaneously.
Language:Python2 2 00
mabirck/and-nd-firebase
Course code repository for Firebase in a Weekend by Google: Android
Language:Java1 2 00
mabirck/AndroidUdacity
Udacity course, implementations and code.
1 2 00
mabirck/Berkeley_CS188
Repositorie containing solutions for the Berkeley CS188 class.
Language:Python1 2 00
mabirck/CS20SI_Tensorflow4DL_Research
Code and stuff from Stanford course on Tensorflow
1 2 0
mabirck/CS294-DeepRL
My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
Language:Python1 2 0
mabirck/DDPG-Keras-Torcs
Using Keras and Deep Deterministic Policy Gradient to play TORCS
Language:Python1 2 0
mabirck/Deep_RL_Bootcamp
Solutions for the labs in Deep RL Bootcamp.
Language:Jupyter Notebook1 2 0
mabirck/deeplearning_tutorials
Plenty of Deep Learning resources in companion with notebooks, for learning purposes.
Language:Jupyter Notebook1 2 0
mabirck/Generic_Seq2Seq
I replicate and make the original Seq2Seq from PyTorch tutorials to be easy to use and adapt.
Language:Python1 2 0
mabirck/graphium
Let me try pimp a city with amazing graffitis
Language:JavaScript1 2 0
mabirck/joint_tf_ppo
Extension of https://github.com/openai/baselines, making it feasible to run train on multiple games simultaneously.
Language:Python1 2 0
mabirck/Locally-Competitive-a2c
Language:Python1 2 0
mabirck/modular_DeepRL
Attempt to implement A2C and PPO algorithm with modular properties of Maxout and LWTA. # UNFINISHED AND FAILED
Language:Python1 2 0
mabirck/our-daily-paper
Paper List I have read or will read, just to keep control. (I should have done this before!!!)
1 3 01
mabirck/pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
Language:Python1 3 0
mabirck/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Language:Python1 3 0
mabirck/pytorch-ewc
PyTorch implementation of DeepMind's PNAS 2017 paper "Overcoming Catastrophic Forgetting"
Language:Python1 3 0
mabirck/pytorch-lr-scheduler
Bring some LR schedulers from Keras to PyTorch.
Language:Python1 2 0
mabirck/Video_GAN_Sonic
[UNDERDEVELOPED, CHECK THE LINK BELOW] This was an early attempt to Generate a Sonic frame from past frames using GANs. I will open this cause there is plenty of useful infra code concerning steps to make it happen, besides no convergence is achieved in this repo!
Language:Python1 2 00
mabirck/rl_a3c_pytorch
Reinforcement learning A3C LSTM Atari with Pytorch
Language:Python2 0
mabirck/texufpel
Classe LaTeX para documentos da UFPel (especificamente documentos da Computação)
Language:TeX2 0
mabirck/ud851-Exercises
Language:Java2 0
mabirck/ud851-Sunshine
Language:Java2 0