bandit-algorithms

There are 86 repositories under bandit-algorithms topic.

SMPyBandits/SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
Language:Jupyter Notebook389 20 21058
c-bata/goptuna
A hyperparameter optimization framework, inspired by Optuna.
Language:Go261 10 4722
WilliamLwj/PyXAB
PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms
Language:Python154 26 1842
KKeishiro/Yahoo_recommendation
Yahoo! news article recommendation system by linUCB
Language:Python107 4 044
gdmarmerola/interactive-intro-rl
Big Data's open seminars: An Interactive Introduction to Reinforcement Learning
Language:Jupyter Notebook63 2 020
sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Language:Jupyter Notebook55 5 125
Alanthink/banditpylib
A lightweight python library for bandit algorithms
Language:Python29 4 115
niffler92/Bandit
Bandit algorithms
Language:Python29 2 06
doerlbh/MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Language:Cuda27 3 05
kulinshah98/Multi-Armed-Bandit-Algorithms
Python implementation of UCB, EXP3 and Epsilon greedy algorithms
Language:Python27 2 210
gdmarmerola/advanced-bandit-problems
More about the exploration-exploitation tradeoff with harder bandits
Language:Jupyter Notebook23 2 013
mmalekzadeh/privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
Language:Jupyter Notebook22 1 07
ZIYU-DEEP/Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems
A curated list on papers about combinatorial multi-armed bandit problems.
17 2 10
rssalessio/reading-list
This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.
11 2 00
gokceuludogan/interactive-music-recommendation
Personalized and Interactive Music Recommendation with Bandit approach
Language:Jupyter Notebook10 5 03
sparsh-ai/reco-bandit
Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning
Language:Jupyter Notebook10 3 04
babaniyi/Deep-contextual-bandits
A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.
Language:Python9 1 11
MaxenceGiraud/MachineLearningAlgos
Personal reimplementation of some ML algorithms for learning purposes
Language:Python9 3 52
Naereen/Kullback-Leibler-divergences-and-kl-UCB-indexes
🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes
Language:HTML9 5 37
ngutowski/algossim
This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.
Language:Python8 3 03
albertopirillo/ola-project-2023
Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.
Language:Python7 1 00
doerlbh/BanditZoo
Python library of bandits and RL agents in different real-world environments
Language:Python7 1 24
jayrcausal/Essential3CRL
Research about Causality-based Reinforcement Learning. This repository includes all needed fundamentals, summary of past work and some most recent development
Language:Jupyter Notebook7 2 00
ZiruiYan/awesome-causal-bandit
An list of papers for causal bandit
7 1 00
DURUII/Replica-AUCB
🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"
Language:Python6 1 00
GjjvdBurg/ThompsonSampling
Source code for blog post on Thompson Sampling
Language:JavaScript6 4 01
niravnb/Multi-armed-bandit-algortihms
Implementation of famous Bandits algortihm: Explore then commit, UCB & Thompson sampling in python.
Language:Jupyter Notebook6 1 13
amirhosein-mesbah/Reinforcement_learning
This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.
Language:Jupyter Notebook5 2 00
duongnhatthang/meta-bandit
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Language:Python5 3 00
guptav96/bandit-algorithms
A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB
Language:Python5 1 02
nicoleorzan/Multi-armed-bandit-RL
C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)
Language:C++5 2 02
amirbalef/PS_MOMAB
Multi-Objective Multi-Armed Bandit
Language:Python4 1 02
anishacharya/Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
Language:Jupyter Notebook4 3 00
jia-yi-chen/Bandit-and-Reinforcement-Learning
Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)
Language:Python4 2 01
junjiedong/warfarin-bandit
Contextual Bandit algorithms for Warfarin Treatment
Language:Jupyter Notebook4 2 01
MIFA-Lab/LDPbandit2020
Implementation for NeurIPS 2020 paper "Locally Differentially Private (Contextual) Bandits Learning" (https://arxiv.org/abs/2006.00701)
Language:Python4 1 01

bandit-algorithms

SMPyBandits/SMPyBandits

c-bata/goptuna

WilliamLwj/PyXAB

KKeishiro/Yahoo_recommendation

gdmarmerola/interactive-intro-rl

sshkhr/Practical_RL

Alanthink/banditpylib

niffler92/Bandit

doerlbh/MiniVox

kulinshah98/Multi-Armed-Bandit-Algorithms

gdmarmerola/advanced-bandit-problems

mmalekzadeh/privacy-preserving-bandits

ZIYU-DEEP/Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

rssalessio/reading-list

gokceuludogan/interactive-music-recommendation

sparsh-ai/reco-bandit

babaniyi/Deep-contextual-bandits

MaxenceGiraud/MachineLearningAlgos

Naereen/Kullback-Leibler-divergences-and-kl-UCB-indexes

ngutowski/algossim

albertopirillo/ola-project-2023

doerlbh/BanditZoo

jayrcausal/Essential3CRL

ZiruiYan/awesome-causal-bandit

DURUII/Replica-AUCB

GjjvdBurg/ThompsonSampling

niravnb/Multi-armed-bandit-algortihms

amirhosein-mesbah/Reinforcement_learning

duongnhatthang/meta-bandit

guptav96/bandit-algorithms

nicoleorzan/Multi-armed-bandit-RL

amirbalef/PS_MOMAB

anishacharya/Bandits-Online-Learning

jia-yi-chen/Bandit-and-Reinforcement-Learning

junjiedong/warfarin-bandit

MIFA-Lab/LDPbandit2020