bandit-learning
There are 36 repositories under bandit-learning topic.
cair/TsetlinMachine
Code and datasets for the Tsetlin Machine
cair/pyTsetlinMachine
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
Nth-iteration-labs/contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
SamRagusa/Checkers-Reinforcement-Learning
A checkers reinforcement learning AI, and all the tools needed to train it.
cair/convolutional-tsetlin-machine-tutorial
Tutorial on the Convolutional Tsetlin Machine
cair/pyTsetlinMachineParallel
Multi-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.
thunfischtoast/LinUCB
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
mmalekzadeh/privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
etiennekintzler/visualize_bandit_algorithms
Some visualizations of bandit algorithm outputs.
Nth-iteration-labs/streamingbandit-ui
Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.
AntoineG92/Online-Clustering-of-Bandits-ENSAE
Based on Gentile-Li-Zapella article "Online Clustering of Bandits"
crenwick/Swiper
🦊 A series of bandit algorithms in Swift
juliakreutzer/bandit-neuralmonkey
Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).
anishacharya/Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
thiagopbueno/pybayesbandit
Bayesian bandits in Python3.
juliakreutzer/bandit-cdec
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms
0x65-e/Stats-115
Homework Code for UCLA STATS 115 (Probabilistic Decision Making) Fall 22 Offering
chan-yc/comp0089-reinforcement-learning
UCL COMP0089 Reinforcement Learning (2023/24)
florian/reinforcement-learning
Implementing RL algorithms
kapshaul/OnlineLearning
Repository of Online Learning algorithms, including Bandits, UCB, and more.
shashankp914/Over-the-wire-wargames-Solutions
Detailed solution of solving wargames of over the wire which includes bandit and in future many more.
victor-iyi/policy-gradient
A policy gradient approach to a multi-armed bandit problem
znreza/RL_Best_Presentation
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
dscolby/Whiteboard
A virtual whiteboard so I don't forget the ideas that come to me
fouratifares/RGL
Randomized Greedy Learning Under Full-bandit Feedback
jpthanga/10-Arm-Bandit
Implementation of 10 Arm Bandit using RLGlue
SFV-CORE/Bandit_OverTheWire
Aqui irei explicar como passar de cada nível do CTF Bandit fornecido pela Over The Wire
zeroinfiniti/bandit-wargames
Leveling up on the Bandit Wargames
ad0x99/linux-4-fun
My Linux Notes
jonad/smartcab
Train a SmartCab how to drive using reinforcement learning.
victor-iyi/contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
vitorhugo13/feup-mssi
Repository of code developed for the course MSSI @FEUP.