exploration-exploitation

There are 43 repositories under exploration-exploitation topic.

wzhe06/Reco-papers
Classic papers and resources on recommendation
Language:Python3.3k 194 3806
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language:Python3.2k 23 215381
tigerneil/awesome-deep-rl
For deep RL and the future of AI.
Language:HTML1.4k 108 3217
imsheridan/DeepRec
推荐、广告工业界经典以及最前沿的论文、资料集合/ Must-read Papers on Recommendation System and CTR Prediction
1k 48 2219
david-cortes/contextualbandits
Python implementations of contextual bandits algorithms
Language:Python755 23 63147
YaoYao1995/MEEE
Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).
Language:Python470 62 276
opendilab/awesome-exploration-rl
A curated list of awesome exploration RL resources (continually updated)
426 7 012
TianhongDai/self-imitation-learning-pytorch
This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.
Language:Python66 2 013
holarissun/RewardShifting
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
Language:Python28 3 03
stratisMarkou/sample-efficient-bayesian-rl
Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL
Language:Jupyter Notebook22 3 015
gokceuludogan/interactive-music-recommendation
Personalized and Interactive Music Recommendation with Bandit approach
Language:Jupyter Notebook10 5 03
Amshra267/Thompson-Greedy-Comparison-for-MultiArmed-Bandits
Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits
Language:Python9 0 00
kkm24132/ReinforcementLearning
Focuses on Reinforcement Learning related concepts, use cases, and learning approaches
Language:Jupyter Notebook8 2 03
kakaobrain/leco
Official implementation of LECO (NeurIPS'22)
Language:Python7 4 30
mbhenaff/neural-e3
Language:Python7 3 01
baturaysaglam/DISCOVER
Deep Intrinsically Motivated Exploration in Continuous Control
Language:Python5 1 10
guptav96/bandit-algorithms
A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB
Language:Python5 1 02
hmishfaq/LMC-LSVI
The official code release for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo, ICLR 2024.
Language:Python5 2 02
haoyangzheng1996/ts_ulmc
The GitHub repository for "Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo", AISTATS 2024.
Language:Python4 1 00
fabprezja/Deep-Learning-TPBook-Points
Some Key Points from the Deep Learning Tuning Playbook
3 1 01
hridayns/Research-Project-on-Reinforcement-learning
Research Thesis - Reinforcement Learning
Language:Python3 2 01
kochlisGit/Reinforcement-Learning-Algorithms
This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.
Language:Python3 1 00
Sagarnandeshwar/Bandit_Algorithms
Reinforcement Learning (COMP 579) Project
Language:Jupyter Notebook3 2 00
SXV357/Inspirit-AI-Deep-Dive-Designing-DL-Systems-FinalProject-RL-for-Autonomous-Vehicles
This project uses Reinforcement Learning to teach an agent to drive by itself and learn from its observations so that it can maximize the reward(180+ lines)
Language:Jupyter Notebook3 2 00
panxulab/LSVI-ASE
The official code release for "More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling", Reinforcement Learning Conference (RLC) 2024
Language:Python2 1 01
Ralami1859/Action-Elimination-for-Multi-Armed-Bandits
Action elimination for multi-armed bandits
Language:MATLAB2 1 01
spoluan/reinforcement_learning
This repository contains a variety of projects related to reinforcement learning, showcasing different approaches to implementing it in various scenarios.
Language:Jupyter Notebook2 2 00
tyoon10/Exploration-and-Exploitation
Language:Jupyter Notebook2 1 01
alxndrTL/RL-essais-cliniques
1 1 0
baturaysaglam/Q-Error-Exploration
An Optimistic Approach to the Q-Network Error in Actor-Critic Methods
Language:Python1 1 00
kalexandriabond/competing-representations-shape-evidence-accumulation
Human and sim. behavioral / small-scale neural data for paper: https://www.biorxiv.org/content/10.1101/2022.10.03.510668v2
Language:Jupyter Notebook1 1 00
rom1mouret/exploration
over-parameterization = exploration ?
Language:Python1 1 00
ruqoyyasadiq/deep_RL-multi-arm-bandit-exploration
This is an implementation of the Reinforcement Learning multi-arm-bandit experiment using different exploration techniques.
Language:Python1 2 00
zwkcoding/explore_map_standalone
Maintain an environmental exploration map & Update by Bayesian probability **For Autonomous Vehicle**
Language:C++1 2 15
avorozhtsov/shipit
Exploitation vs Exploration problem stated as A/B-testing with maximum profit per unit time.
Language:Mathematica0 2 00
KaranAnchan/10_Arm_Testbed
Explore the 10-Arm Testbed Simulation! 🎲 Utilize Python to test various ε-greedy strategies in a reinforcement learning environment. Visualize and compare agents' performance as they balance exploration and exploitation. Perfect for learners and enthusiasts! 🚀📊
Language:Python0 1 00

exploration-exploitation

wzhe06/Reco-papers

opendilab/DI-engine

tigerneil/awesome-deep-rl

imsheridan/DeepRec

david-cortes/contextualbandits

YaoYao1995/MEEE

opendilab/awesome-exploration-rl

TianhongDai/self-imitation-learning-pytorch

holarissun/RewardShifting

stratisMarkou/sample-efficient-bayesian-rl

gokceuludogan/interactive-music-recommendation

Amshra267/Thompson-Greedy-Comparison-for-MultiArmed-Bandits

kkm24132/ReinforcementLearning

kakaobrain/leco

mbhenaff/neural-e3

baturaysaglam/DISCOVER

guptav96/bandit-algorithms

hmishfaq/LMC-LSVI

haoyangzheng1996/ts_ulmc

fabprezja/Deep-Learning-TPBook-Points

hridayns/Research-Project-on-Reinforcement-learning

kochlisGit/Reinforcement-Learning-Algorithms

Sagarnandeshwar/Bandit_Algorithms

SXV357/Inspirit-AI-Deep-Dive-Designing-DL-Systems-FinalProject-RL-for-Autonomous-Vehicles

panxulab/LSVI-ASE

Ralami1859/Action-Elimination-for-Multi-Armed-Bandits

spoluan/reinforcement_learning

tyoon10/Exploration-and-Exploitation

alxndrTL/RL-essais-cliniques

baturaysaglam/Q-Error-Exploration

kalexandriabond/competing-representations-shape-evidence-accumulation

rom1mouret/exploration

ruqoyyasadiq/deep_RL-multi-arm-bandit-exploration

zwkcoding/explore_map_standalone

avorozhtsov/shipit

KaranAnchan/10_Arm_Testbed