fabienpesquerel

PhD student in Reinforcement Learning.

INRIA SCOOLVilleneuve d'Ascq, France

Pinned Repositories

ComputationalStatistics_Class
A theoretical and numerical tour of computational statistics
0 1 00
cppn
A pytorch implementation of what is described by hardmaru: http://blog.otoro.net/2016/04/01/generating-large-images-from-latent-vectors/
Language:Python4 1 01
Cycle-GAN-for-euclidian-spaces
A cycle gan model to handle euclidian-like distribution
Language:Python0 1 00
fabienpesquerel.github.io
Language:HTML0 1 00
forban
A simple environment to test and benchmark algorithms tackling various bandit problems.
Language:Python2 2 00
IMED-RL
Source code for the paper "IMED-RL: Regret optimal learning of ergodic Markov decision processes" - NeurIPS 2022
Language:Python2 1 01
Logarithmic-regret-in-communicating-MDPs-Leveraging-known-dynamics-with-bandits
Code for NeurIPS 2023. We study regret minimization in an average-reward and communicating Markov Decision Process (MDP) with known dynamics, but unknown reward function.
Language:Python0 1 00
MCMC
Python implementation (from scratch) of some MCMC samplers that can leverage pyTorch's autodifferentiation (with examples).
Language:Python4 2 00
ReinforcementLearning_Class
A theoretical and numerical tour of Reinforcement Learning (and sequential decision making)
0 1 00
stochastic-bandits-with-groups-of-similar-arms-neurips-2021
This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.
Language:Jupyter Notebook0 2 00

fabienpesquerel's Repositories

fabienpesquerel/cppn
A pytorch implementation of what is described by hardmaru: http://blog.otoro.net/2016/04/01/generating-large-images-from-latent-vectors/
Language:Python4 1 01
fabienpesquerel/MCMC
Python implementation (from scratch) of some MCMC samplers that can leverage pyTorch's autodifferentiation (with examples).
Language:Python4 2 00
fabienpesquerel/forban
A simple environment to test and benchmark algorithms tackling various bandit problems.
Language:Python2 2 00
fabienpesquerel/IMED-RL
Source code for the paper "IMED-RL: Regret optimal learning of ergodic Markov decision processes" - NeurIPS 2022
Language:Python2 1 01
fabienpesquerel/ComputationalStatistics_Class
A theoretical and numerical tour of computational statistics
0 1 00
fabienpesquerel/Cycle-GAN-for-euclidian-spaces
A cycle gan model to handle euclidian-like distribution
Language:Python0 1 00
fabienpesquerel/fabienpesquerel.github.io
Language:HTML0 1 00
fabienpesquerel/Logarithmic-regret-in-communicating-MDPs-Leveraging-known-dynamics-with-bandits
Code for NeurIPS 2023. We study regret minimization in an average-reward and communicating Markov Decision Process (MDP) with known dynamics, but unknown reward function.
Language:Python0 1 00
fabienpesquerel/ReinforcementLearning_Class
A theoretical and numerical tour of Reinforcement Learning (and sequential decision making)
0 1 00
fabienpesquerel/stochastic-bandits-with-groups-of-similar-arms-neurips-2021
This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.
Language:Jupyter Notebook0 2 00
fabienpesquerel/tda_project
Language:Python0 3 01
fabienpesquerel/thesis
Repository for my PhD manuscript
1 0

fabienpesquerel

Pinned Repositories

ComputationalStatistics_Class

cppn

Cycle-GAN-for-euclidian-spaces

fabienpesquerel.github.io

forban

IMED-RL

Logarithmic-regret-in-communicating-MDPs-Leveraging-known-dynamics-with-bandits

MCMC

ReinforcementLearning_Class

stochastic-bandits-with-groups-of-similar-arms-neurips-2021

fabienpesquerel's Repositories

fabienpesquerel/cppn

fabienpesquerel/MCMC

fabienpesquerel/forban

fabienpesquerel/IMED-RL

fabienpesquerel/ComputationalStatistics_Class

fabienpesquerel/Cycle-GAN-for-euclidian-spaces

fabienpesquerel/fabienpesquerel.github.io

fabienpesquerel/Logarithmic-regret-in-communicating-MDPs-Leveraging-known-dynamics-with-bandits

fabienpesquerel/ReinforcementLearning_Class

fabienpesquerel/stochastic-bandits-with-groups-of-similar-arms-neurips-2021

fabienpesquerel/tda_project

fabienpesquerel/thesis