Pinned Repositories
Advanced-Lane-Finding
Udacity Self-driving Car Nanodegree Project
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
awesome-project-ideas
Curated list of Machine Learning, NLP, Vision, Reinforcement Learning Project Ideas
awesome-semantic-segmentation
:metal: awesome-semantic-segmentation
BCQ
Author's PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
cnn-facial-landmark
Facial landmark detection based on deep convolutional neural network.
Complete-Python-3-Bootcamp
Course Files for Complete Python 3 Bootcamp Course on Udemy
continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, online EWC, SI, LwF, GR, GR+distill, RtF, ER, A-GEM, iCaRL).
Courses-
Quiz & Assignment of Coursera
cse474-ub-notebooks
Collection of iPython notebooks for UB CSE Machine Learning course (CSE 474/574)
lalapo's Repositories
lalapo/BCQ
Author's PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
lalapo/continual-learning
PyTorch implementation of various methods for continual learning (XdG, EWC, online EWC, SI, LwF, GR, GR+distill, RtF, ER, A-GEM, iCaRL).
lalapo/Courses-
Quiz & Assignment of Coursera
lalapo/cse474-ub-notebooks
Collection of iPython notebooks for UB CSE Machine Learning course (CSE 474/574)
lalapo/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
lalapo/Deep-Reinforcement-Learning-Hands-On-Second-Edition
Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt
lalapo/Dp-without-Adv
Guided dialogue policy learning without adversarial learning in the loop
lalapo/fastbook
The fastai book, published as Jupyter Notebooks
lalapo/GDPL
Task-oriented Dialog Policy Learning with Adversarial Inverse Reinforcement Learning
lalapo/GIRIL
ICML'20: Intrinsic Reward Driven Imitation Learning via Generative Model
lalapo/iTAML
Official implementation of "iTAML : An Incremental Task-Agnostic Meta-learning Approach". CVPR 2020
lalapo/lalapo.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
lalapo/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
lalapo/MADPL
Task-oriented Dialog Policy Learning with Multi-Agent Reinforcement Learning
lalapo/MER
Fork of the GEM project (https://github.com/facebookresearch/GradientEpisodicMemory) including Meta-Experience Replay (MER) methods from the ICLR 2019 paper (https://openreview.net/pdf?id=B1gTShAct7)
lalapo/metalight
MetaLight: a value-based meta-reinforcement learning framework for traffic signal control
lalapo/MPLight
lalapo/multi_timescale_replay
Code for Continual Reinforcement Learning with Multi-Timescale Replay
lalapo/neural_chat
Code to support training, evaluating and interacting neural network dialog models, and training them with reinforcement learning. Code to deploy a web server which hosts the models live online is available at: https://github.com/asmadotgh/neural_chat_web
lalapo/NeuralDialog-LaRL
PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
lalapo/Practical_RL
A course in reinforcement learning in the wild
lalapo/proj-lfi-ml
AMoD 2019 - ML based Lane following
lalapo/pytorch-Deep-Learning
Deep Learning (with PyTorch)
lalapo/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
lalapo/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
lalapo/RL-Indirect-imitation
Code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
lalapo/ryu
Ryu component-based software defined networking framework
lalapo/soft-Q-learning
discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!
lalapo/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
lalapo/TTOS
Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial Training