Pinned Repositories
rlberry
An easy-to-use reinforcement learning library for research and education.
SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
actor-with-variance-estimated-critic
AVEC: Actor with Variance Estimated Critic
adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
citygan
Simulating urban land use maps with generative adversarial networks
kaggle-shelter-animal-outcomes
[Project with KAIST] Data Mining Kaggle competition. Best score using xgboost.
machine-translation-tensorflow
Machine Translation for specialized texts using LSTMs.
mango
Question-Answering NLP model with character-level RNN (TensorFlow).
MERL
MERL: Multi-Head Reinforcement Learning (TensorFlow).
rlss-2019
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
yfletberliac's Repositories
yfletberliac/rlss-2019
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
yfletberliac/adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
yfletberliac/mango
Question-Answering NLP model with character-level RNN (TensorFlow).
yfletberliac/actor-with-variance-estimated-critic
AVEC: Actor with Variance Estimated Critic
yfletberliac/MERL
MERL: Multi-Head Reinforcement Learning (TensorFlow).
yfletberliac/citygan
Simulating urban land use maps with generative adversarial networks
yfletberliac/kaggle-shelter-animal-outcomes
[Project with KAIST] Data Mining Kaggle competition. Best score using xgboost.
yfletberliac/machine-translation-tensorflow
Machine Translation for specialized texts using LSTMs.
yfletberliac/rasa_nlu
turn natural language into structured data
yfletberliac/saacjax
yfletberliac/alpha-zero
yfletberliac/alpha-zero-connect4
yfletberliac/coinrun
yfletberliac/d3rlpy
An offline deep reinforcement learning library
yfletberliac/DrQA
Reading Wikipedia to Answer Open-Domain Questions
yfletberliac/entity-network
Tensorflow implementation of "Tracking the World State with Recurrent Entity Networks" [https://arxiv.org/abs/1612.03969] by Henaff, Weston, Szlam, Bordes, and LeCun.
yfletberliac/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
yfletberliac/minigo
yfletberliac/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
yfletberliac/pyduckling
yfletberliac/ray
yfletberliac/recurrent-entity-networks
TensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".
yfletberliac/SAUNA
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL (TensorFlow).
yfletberliac/server
The interface between data scientists and developers.
yfletberliac/Ssup
yfletberliac/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
yfletberliac/transformrl
yfletberliac/trtn.github.io
yfletberliac/ViZDoom
Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:
yfletberliac/vizdoomgym
OpenAI Gym wrapper for ViZDoom enviroments