It contains a Matlab implementation of the BanditQ policy described in our UAI 2024 paper (https://www.tifr.res.in/~abhishek.sinha/files/BanditQ_UAI24.pdf).
Instruction to run the code: Please download the entire code folder. To experiment with the full information setting, please run the script "BanditQ_full_info.m". To experiment with the bandit information setting, please run the script "BanditQ_bandit_expt.m". The necessary parameters can be changed at the beginning of the script. The plots will be saved as pdf files in the same folder.