sfujim

McGill UniversityMontreal, Quebec, Canada

Pinned Repositories

ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Language:Python3.6k 147 109524
quicksand
Sentiment Analysis Project with Derek Ruths
Language:Python1 3 00
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python603 7 15140
Horizon
A platform for Applied Reinforcement Learning (Applied RL)
Language:Python1 1 01
LAP-PAL
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
Language:Python34 1 07
SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
Language:Python15 1 02
TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Python1.8k 19 41440
TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
Language:Python339 4 447
TD7
Author's PyTorch implementation of TD7 for online and offline RL
Language:Python121 4 512

sfujim's Repositories

sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Python1.8k 19 41440
sfujim/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python603 7 15140
sfujim/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
Language:Python339 4 447
sfujim/TD7
Author's PyTorch implementation of TD7 for online and offline RL
Language:Python121 4 512
sfujim/LAP-PAL
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
Language:Python34 1 07
sfujim/SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
Language:Python15 1 02
sfujim/Horizon
A platform for Applied Reinforcement Learning (Applied RL)
Language:Python1 1 01