/dual-bandit-kdd-2020

Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.

Primary LanguagePythonMIT LicenseMIT

Stargazers