olivierjeunen/dual-bandit-kdd-2020
Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.
PythonMIT
Stargazers
- AppServiceProviderDhaka, Bangladesh
- charleshuangruo
- dsflu
- esafakArchipelago AI
- fanlinboAmazing Seasun
- fly51flyPRIS
- geopanagAmazon
- huiwang98Soochow University
- JusticeTorpedo
- kiminh
- maosengshulei
- mindisMarks and Spencer
- mquadPolitecnico di Milano
- nimitpattanasriUpwork
- pigooosukeTokyo/JP
- pm3310King (Microsoft)
- rjagermanGoogle
- russellkimHKUST
- shashankg7UvA
- sumitsidanaWolt
- sungjinl
- travisbrady