Recursos de Aprendizaje reforzado para sistemas de recomendación.

  • Lista de papers, codigo, datasets y otros recursos relacionado con el aprendizaje reforzado y los sistemas de recomendación (Algunos incluyen el link para el PDF, código y dataset).


[P1] Session-aware Item-combination Recommendation with Transformer Network [PDF]

[P2] RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems [PDF]

[P3] A Contextual-Bandit Approach to Personalized News Article Recommendation [PDF]

[P4] Partially Observable Reinforcement Learning for Dialog-based Interactive Recommendation [PDF]

[P5] Reinforcement Learning over Sentiment-Augmented Knowledge Graphs towards Accurate and Explainable Recommendation WSDM'22 [PDF]

[P6] Improving Daily Deals Recommendation Using Explore-Then-Exploit Strategies [PDF]

[P7] Scalable explore-exploit collaborative filtering. [PDF]

[P8] Factorization Bandits for Interactive Recommendation. [PDF]

[P9] Bandits and Recommender Systems. [PDF]

[P10] Adaptive, personalized diversity for visual discovery. [PDF]

[P11] Online clustering of bandits. [PDF]

[P12] Learning diverse rankings with multi-armed bandits. [PDF]

[P13] A Fast Bandit Algorithm for Recommendations to Users with Heterogeneous Tastes [PDF]

[P14] Contextual combinatorial bandit and its application on diversified online recommendation. [PDF]

[P15] A Multiple-Play Bandit Algorithm Applied to Recommender Systems. [PDF]

[P16] Top-k off-policy correction for a REINFORCE recommender system. [PDF] [Link Video Youtube]

[P17] Unified conversational recommendation policy learning via graph-based reinforcement learning [PDF]

[P18] When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution [PDF]

[P19] Cluster-Based Bandits: Fast Cold-Start for Recommender System New Users [PDF]

[P20] Comparison-based Conversational Recommender System with Relative Bandit Feedback [PDF]

[P21] A Survey of Deep Reinforcement Learning in Recommender Systems: A Systematic Review and Future Directions [PDF]

[P22] Reinforcement learning based recommender systems: A survey [PDF]

[P23] Online Decision Transformer [PDF]

[P24] Rethinking Reinforcement Learning for Recommendation: A Prompt Perspective [PDF]

[P25] Self-Supervised Reinforcement Learning for Recommender Systems [PDF]

ContentWise impressions: an industrial dataset with impressions included [dataset repo]

Goodreads: meta-data of the books, user-book interactions (users' public shelves) and users' detailed book reviews. [dataset repo]

Goodreads spoilers [link]

Amazon Product Reviews (2018) [dataset repo]

Pinterest Fashion Compatibility [dataset repo]

Clothing Fit Data [Modcloth dataset]

Product Exchange/Bartering Data [dataset repo]

Ambientes de Simulación

RL para publicidad en línea


[L2] Deep Learning on Graphs [PDF]

