mab
There are 24 repositories under mab topic.
alison-carrera/onn
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
LibreCat/Catmandu
Catmandu - a data processing toolkit
alison-carrera/mabalgs
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
Nth-iteration-labs/streamingbandit
Python application to setup and run streaming (contextual) bandit experiments.
MatteoGuadrini/vmam
VLAN Mac-address Authentication Manager
v-i-s-h/MAB.jl
A Julia Package for providing Multi Armed Bandit Experiments
jacksonpradolima/coleman4hcs
COLEMAN (Combinatorial VOlatiLE Multi-Armed BANdit) - and strategies for HCS context
pko89403/Recommender
Implementation of recommender ( Pytorch & Keras )
duchuyle108/SDN-EgressNode-Selection
The work in paper "A Reinforcement Learning-Based Solution for Intra-Domain Egress Selection" - Duc-Huy LE, Hai Anh TRAN
DURUII/Replica-AUCB
🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"
juliennonin/multiplayer-bandits
Multi-Player Bandits Revisited [L. Besson & É. Kaufmann]
aldente0630/multi_armed_bandit
Experiment results using MAB algorithms in Yahoo! Front Page Today Module User Click Log dataset
abhinavcreed13/Multi-armed-bandits-MAB
This project implements famous MAB algorithms and evaluates them on the basis of their performance - EpsilonGreedy, UCB, BetaThompson, LinUCB, LinThompson.
vmarchaud/ts-mab
Typescript implementation of a multi-armed bandit
DURUII/Replica-EUWR
🐯REPLICA of "Combinatorial Multi-Armed Bandit Based Unknown Worker Recruitment in Heterogeneous Crowdsensing"
pm3310/mab-covid19
Multi-Armed-Bandit solutions on AWS to deliver Covid-19 test kits efficiently and effectively
tuhinsharma121/pybandit-archive
A Python library for all popular multi-armed bandit algorithms.
aijunbai/bandit
Algorithms for multi-armed bandit (MAB) problems
jiseongHAN/reinforcement
My Little Reinforcement Learning
avorozhtsov/shipit
Exploitation vs Exploration problem stated as A/B-testing with maximum profit per unit time.
Bachfischer/COMP90051-StatML-Assignment-2
Source code for Assignment 2 of COMP90051 (Semester 2 2020)
VladMarianCimpeanu/OLA_project
Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)
JoelJa835/MAB_Algorithms
Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.
sshaplygin/abcs
Adaptive bandit cache selection