Pinned Repositories
ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
quicksand
Sentiment Analysis Project with Derek Ruths
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Horizon
A platform for Applied Reinforcement Learning (Applied RL)
LAP-PAL
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
TD7
Author's PyTorch implementation of TD7 for online and offline RL
sfujim's Repositories
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
sfujim/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
sfujim/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
sfujim/TD7
Author's PyTorch implementation of TD7 for online and offline RL
sfujim/LAP-PAL
Author's PyTorch implementation of LAP and PAL with TD3 and DDQN
sfujim/SR-DICE
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
sfujim/Horizon
A platform for Applied Reinforcement Learning (Applied RL)