$ conda create --name rdp python=3.8.10
$ conda activate rdp
$ pip install -r requirements.txt
$ cd ./nonmarkov-envs
$ pip install .
$ cd ./nonmarkov-env/nonmarkov-env
$ python __main__.py --env ENV_NAME --only-mcts ONLY-MCTS --episode EPISODE --max-iterations MAX-ITERATION --min-samples MIN-SAMPLE --mixed MIXED
where
ENV_NAME
is the name of the domain, you can choose between:CheatMAB, RotatingMAB, RotatingMaze
ONLY-MCTS
choose if you want to perform only mcts algorithm, default is FalseEPISODE
length of the episode of the env, default is 5MAX-ITERATION
maximum number of iterations of the S3M algorithm, default is 4500MIN-SAMPLE
minimum number of samples of the S3M algorithm, default is 100MIXED
choose if you want to perform mixed S3M algorithm and MCTS saving figure, default is False