Goal: Zero migration of the decision model in the virtual scene to the real scene guarantees good adaptivity and stability.
- AMDDPG
- AMRL
- PPO
- TRPO
- SAC
- MAML
- DDPG
- RL^2
- EPG
- DQN
- DDQN
- python=3.9
- mlagents==0.29.0
- torch
- gym
- numpy==1.20.3
- torch==1.8.1+cu102 torchvision==0.9.1+cu102 torchaudio===0.8.1 -f https://download.pytorch.org/whl/torch_stable.html
- reinforcement learning
- target detection
- Semantic segmentation
1) python amddg_run.py
2) python amrl_run.py
- Deep Q-Network (DQN)
- Double DQN
- Proximal Policy Optimization (PPO)
- Deep Deterministic Policy Gradient (DDPG)
- Soft Actor-Critic (SAC)
- Intrinsic Curiosity Module (ICM)
- CleanRL is a learning library based on the Gym API. It is designed to cater to newer people in the field and provides very good reference implementations.
- Tianshou is a learning library that's geared towards very experienced users and is design to allow for ease in complex algorithm modifications.
- RLlib is a learning library that allows for distributed training and inferencing and supports an extraordinarily large number of features throughout the reinforcement learning space.
- Ray/Lilib Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for simplifying ML compute.
@article{xiao2022feature,
title={Feature semantic space-based sim2real decision model},
author={Xiao, Wenwen and Luo, Xiangfeng and Xie, Shaorong},
journal={Applied Intelligence},
pages={1--17},
year={2022},
publisher={Springer}
}