yfletberliac

Research @ Cohere | Postdoc @ Stanford | PhD @ Inria

InstadeepParis, France

Pinned Repositories

rlberry
An easy-to-use reinforcement learning library for research and education.
Language:Python161 9 13130
SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
Language:Jupyter Notebook391 20 21058
actor-with-variance-estimated-critic
AVEC: Actor with Variance Estimated Critic
Language:Python5 3 01
adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
Language:Python48 4 38
citygan
Simulating urban land use maps with generative adversarial networks
Language:Jupyter Notebook1 2 00
kaggle-shelter-animal-outcomes
[Project with KAIST] Data Mining Kaggle competition. Best score using xgboost.
Language:Python1 1 00
machine-translation-tensorflow
Machine Translation for specialized texts using LSTMs.
Language:Shell1 1 00
mango
Question-Answering NLP model with character-level RNN (TensorFlow).
Language:Python16 5 52
MERL
MERL: Multi-Head Reinforcement Learning (TensorFlow).
Language:Python5 4 00
rlss-2019
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
Language:Jupyter Notebook88 10 043

yfletberliac's Repositories

yfletberliac/rlss-2019
Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).
Language:Jupyter Notebook88 10 043
yfletberliac/adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
Language:Python48 4 38
yfletberliac/mango
Question-Answering NLP model with character-level RNN (TensorFlow).
Language:Python16 5 52
yfletberliac/actor-with-variance-estimated-critic
AVEC: Actor with Variance Estimated Critic
Language:Python5 3 01
yfletberliac/MERL
MERL: Multi-Head Reinforcement Learning (TensorFlow).
Language:Python5 4 00
yfletberliac/citygan
Simulating urban land use maps with generative adversarial networks
Language:Jupyter Notebook1 2 00
yfletberliac/kaggle-shelter-animal-outcomes
[Project with KAIST] Data Mining Kaggle competition. Best score using xgboost.
Language:Python1 1 00
yfletberliac/machine-translation-tensorflow
Machine Translation for specialized texts using LSTMs.
Language:Shell1 1 00
yfletberliac/rasa_nlu
turn natural language into structured data
Language:Python1 1 01
yfletberliac/saacjax
Language:Python1 3 00
yfletberliac/alpha-zero
Language:Jupyter Notebook1 0
yfletberliac/alpha-zero-connect4
Language:Python1 0
yfletberliac/coinrun
Language:C++1 0
yfletberliac/d3rlpy
An offline deep reinforcement learning library
Language:Python1 0
yfletberliac/DrQA
Reading Wikipedia to Answer Open-Domain Questions
Language:Python2 0
yfletberliac/entity-network
Tensorflow implementation of "Tracking the World State with Recurrent Entity Networks" [https://arxiv.org/abs/1612.03969] by Henaff, Weston, Szlam, Bordes, and LeCun.
Language:Python2 0
yfletberliac/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
Language:Python1 0
yfletberliac/minigo
Language:Python1 0
yfletberliac/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
1 0
yfletberliac/pyduckling
Language:Python2 0
yfletberliac/ray
Language:Python1 0
yfletberliac/recurrent-entity-networks
TensorFlow implementation of "Tracking the World State with Recurrent Entity Networks".
Language:Python2 0
yfletberliac/SAUNA
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL (TensorFlow).
Language:Python3 0
yfletberliac/server
The interface between data scientists and developers.
Language:Scala1 0
yfletberliac/Ssup
Language:Python4 0
yfletberliac/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python1 01
yfletberliac/transformrl
Language:Python2 0
yfletberliac/trtn.github.io
Language:CSS1 0
yfletberliac/ViZDoom
Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:
Language:C++1 0
yfletberliac/vizdoomgym
OpenAI Gym wrapper for ViZDoom enviroments
Language:Python1 01