contextual-bandits

There are 56 repositories under contextual-bandits topic.

VowpalWabbit/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
Language:C++8.4k 352 1.3k1.9k
tensorflow/agents
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Language:Python2.8k 80 658720
david-cortes/contextualbandits
Python implementations of contextual bandits algorithms
Language:Python720 23 56140
st-tech/zr-obp
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Language:Python624 88 4284
fidelity/mabwiser
[IJAIT 2021] MABWiser: Contextual Multi-Armed Bandits Library
Language:Python198 12 2437
alison-carrera/onn
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
Language:Python173 5 844
alison-carrera/mabalgs
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
Language:Python126 4 526
Nth-iteration-labs/contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Language:R79 7 2427
banditml/banditml
A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
Language:Python64 5 310
instadeepai/catx
🐈‍⬛ Contextual bandits library for continuous action trees with smoothing in JAX
Language:Python62 3 23
lil-lab/blocks
Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)
Language:Python40 5 214
pemami4911/sinkhorn-policy-gradient.pytorch
Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"
Language:Python38 6 210
Heewon-Hailey/multi-armed-bandits-for-recommendation-systems
implement basic and contextual MAB algorithms for recommendation system
Language:Jupyter Notebook31 3 18
thunfischtoast/LinUCB
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
Language:Java28 2 111
doerlbh/MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Language:Cuda25 3 05
mmalekzadeh/privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
Language:Jupyter Notebook22 1 07
improve-ai/python-ranker
Contextual Multi-Armed Bandit Platform for Scoring, Ranking & Decisions
Language:Python21 6 01
RonyAbecidan/Neural-Thompson-Sampling
Study of the paper 'Neural Thompson Sampling' published in October 2020
Language:Jupyter Notebook20 1 25
jtcho/FairMachineLearning
Implementation of provably Rawlsian fair ML algorithms for contextual bandits.
Language:Jupyter Notebook14 4 04
thoughtworks/simplebandit
lightweight contextual bandit library for ts/js
Language:TypeScript13 13 20
improve-ai/swift-ranker
Easily Score & Rank Codable Objects with ML
Language:Swift11 3 00
sparsh-ai/reco-bandit
Building recommender Systems using contextual bandit methods to address cold-start issue and online real-time learning
Language:Jupyter Notebook10 3 02
travisbrady/ocaml-vw
OCaml bindings to vowpal wabbit
Language:OCaml10 3 00
marlesson/meta-bandit-selector
The Contextual Meta-Bandit (CMB) can be used to select models using the context with online learning based on Reiforcement Learning problem. It's can be used for recommender system ensemble, A/B test, and other dynamic model selector problem.
Language:Jupyter Notebook9 3 13
hsm207/cb-trading
Code to trade the financial markets using Contextual Bandits
Language:Jupyter Notebook8 3 02
Nth-iteration-labs/streamingbandit-ui
Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.
Language:JavaScript8 5 143
improve-ai/tracker-trainer
Contextual Multi-Armed Bandit Reward Tracker & Model Trainer
Language:Python7 5 13
zaid-g/ccb_tutorial
Contextual multi-armed bandit recommender system using Vowpal Wabbit
Language:Python7 2 10
aaronkurz/hitl-ab-bpm
Business Process Improvement with Reinforcement Learning and Human-in-the-Loop.
Language:Python6 4 852
doerlbh/dilemmaRL
Code for our PRICAI 2022 paper: "Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior".
Language:Python6 2 00
aldente0630/multi_armed_bandit
Experiment results using MAB algorithms in Yahoo! Front Page Today Module User Click Log dataset
Language:Jupyter Notebook5 1 13
jackgerrits/reductionml
Reduction-based machine learning framework with a focus on contextual bandits
Language:Rust5 3 52
Murtazali05/LinUCB
LinUCB with disjoint linear models
Language:Python5 2 00
ngutowski/algossim
This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.
Language:Python5 3 03
saeedghoorchian/NCC-Bandits
Experiments for paper "Online Learning with Costly Features in Non-stationary Environments"
Language:Jupyter Notebook5 1 00
TheAmazingElys/NeuralBandit
Code of the NeuralBandit paper
Language:Python5 3 12

contextual-bandits

VowpalWabbit/vowpal_wabbit

tensorflow/agents

david-cortes/contextualbandits

st-tech/zr-obp

fidelity/mabwiser

alison-carrera/onn

alison-carrera/mabalgs

Nth-iteration-labs/contextual

banditml/banditml

instadeepai/catx

lil-lab/blocks

pemami4911/sinkhorn-policy-gradient.pytorch

Heewon-Hailey/multi-armed-bandits-for-recommendation-systems

thunfischtoast/LinUCB

doerlbh/MiniVox

mmalekzadeh/privacy-preserving-bandits

improve-ai/python-ranker

RonyAbecidan/Neural-Thompson-Sampling

jtcho/FairMachineLearning

thoughtworks/simplebandit

improve-ai/swift-ranker

sparsh-ai/reco-bandit

travisbrady/ocaml-vw

marlesson/meta-bandit-selector

hsm207/cb-trading

Nth-iteration-labs/streamingbandit-ui

improve-ai/tracker-trainer

zaid-g/ccb_tutorial

aaronkurz/hitl-ab-bpm

doerlbh/dilemmaRL

aldente0630/multi_armed_bandit

jackgerrits/reductionml

Murtazali05/LinUCB

ngutowski/algossim

saeedghoorchian/NCC-Bandits

TheAmazingElys/NeuralBandit