Pinned Repositories
Deep-Reinforcement-Learning-with-pytorch
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,DDPG for discrete action space, A2C, A3C, TD3, SAC, TRPO
mbpo_pytorch_offline
MBPO (paper: When to trust your model: Model-based policy optimization) in offline RL settings
PECAN
pecan_human_AI_coordination
Human-AI coordination experiments on Overcooked
TAPE
beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
LxzGordon's Repositories
LxzGordon/Deep-Reinforcement-Learning-with-pytorch
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,DDPG for discrete action space, A2C, A3C, TD3, SAC, TRPO
LxzGordon/PECAN
LxzGordon/pecan_human_AI_coordination
Human-AI coordination experiments on Overcooked
LxzGordon/TAPE
LxzGordon/mbpo_pytorch_offline
MBPO (paper: When to trust your model: Model-based policy optimization) in offline RL settings