bandit-algorithms
There are 86 repositories under bandit-algorithms topic.
SMPyBandits/SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
c-bata/goptuna
A hyperparameter optimization framework, inspired by Optuna.
WilliamLwj/PyXAB
PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms
KKeishiro/Yahoo_recommendation
Yahoo! news article recommendation system by linUCB
gdmarmerola/interactive-intro-rl
Big Data's open seminars: An Interactive Introduction to Reinforcement Learning
sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Alanthink/banditpylib
A lightweight python library for bandit algorithms
niffler92/Bandit
Bandit algorithms
doerlbh/MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
kulinshah98/Multi-Armed-Bandit-Algorithms
Python implementation of UCB, EXP3 and Epsilon greedy algorithms
gdmarmerola/advanced-bandit-problems
More about the exploration-exploitation tradeoff with harder bandits
mmalekzadeh/privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
ZIYU-DEEP/Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
rssalessio/reading-list
This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.
gokceuludogan/interactive-music-recommendation
Personalized and Interactive Music Recommendation with Bandit approach
sparsh-ai/reco-bandit
Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning
babaniyi/Deep-contextual-bandits
A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.
MaxenceGiraud/MachineLearningAlgos
Personal reimplementation of some ML algorithms for learning purposes
Naereen/Kullback-Leibler-divergences-and-kl-UCB-indexes
🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes
ngutowski/algossim
This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.
albertopirillo/ola-project-2023
Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.
doerlbh/BanditZoo
Python library of bandits and RL agents in different real-world environments
jayrcausal/Essential3CRL
Research about Causality-based Reinforcement Learning. This repository includes all needed fundamentals, summary of past work and some most recent development
ZiruiYan/awesome-causal-bandit
An list of papers for causal bandit
DURUII/Replica-AUCB
🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"
GjjvdBurg/ThompsonSampling
Source code for blog post on Thompson Sampling
niravnb/Multi-armed-bandit-algortihms
Implementation of famous Bandits algortihm: Explore then commit, UCB & Thompson sampling in python.
amirhosein-mesbah/Reinforcement_learning
This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.
duongnhatthang/meta-bandit
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
guptav96/bandit-algorithms
A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB
nicoleorzan/Multi-armed-bandit-RL
C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)
amirbalef/PS_MOMAB
Multi-Objective Multi-Armed Bandit
anishacharya/Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
jia-yi-chen/Bandit-and-Reinforcement-Learning
Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)
junjiedong/warfarin-bandit
Contextual Bandit algorithms for Warfarin Treatment
MIFA-Lab/LDPbandit2020
Implementation for NeurIPS 2020 paper "Locally Differentially Private (Contextual) Bandits Learning" (https://arxiv.org/abs/2006.00701)