Pinned Repositories
waymax_simulator
planTF
[ICRA'2024] Rethinking Imitation-based Planner for Autonomous Driving
pvp
Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlight)
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
decision-tree
基于kaggle上Titanic数据集实现的ID3、C4.5、CART和CART剪枝算法
offlineRL-INTERACTION
RLexample
Some basic examples of playing with RL
TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
ygo-agent
Yu-Gi-Oh is all you need
weiaiF's Repositories
weiaiF/offlineRL-INTERACTION
weiaiF/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
weiaiF/CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
weiaiF/decision-tree
基于kaggle上Titanic数据集实现的ID3、C4.5、CART和CART剪枝算法
weiaiF/RLexample
Some basic examples of playing with RL
weiaiF/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
weiaiF/ygo-agent
Yu-Gi-Oh is all you need