bandit-learning

There are 36 repositories under bandit-learning topic.

cair/TsetlinMachine
Code and datasets for the Tsetlin Machine
Language:Cython488 49 1455
cair/pyTsetlinMachine
Implements the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, Weighted Tsetlin Machine, and Embedding Tsetlin Machine, with support for continuous features, multigranularity, clause indexing, and literal budget
Language:C145 11 1031
Nth-iteration-labs/contextual
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Language:R81 6 2526
SamRagusa/Checkers-Reinforcement-Learning
A checkers reinforcement learning AI, and all the tools needed to train it.
Language:Python58 5 113
cair/convolutional-tsetlin-machine-tutorial
Tutorial on the Convolutional Tsetlin Machine
Language:Python53 11 013
cair/pyTsetlinMachineParallel
Multi-threaded implementation of the Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetlin Machine, with support for continuous features and multigranularity.
Language:C43 8 19
thunfischtoast/LinUCB
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
Language:Java32 1 111
mmalekzadeh/privacy-preserving-bandits
Privacy-Preserving Bandits (MLSys'20)
Language:Jupyter Notebook22 1 07
etiennekintzler/visualize_bandit_algorithms
Some visualizations of bandit algorithm outputs.
Language:Jupyter Notebook9 1 05
Nth-iteration-labs/streamingbandit-ui
Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.
Language:JavaScript8 4 144
AntoineG92/Online-Clustering-of-Bandits-ENSAE
Based on Gentile-Li-Zapella article "Online Clustering of Bandits"
Language:Jupyter Notebook4 0 01
crenwick/Swiper
🦊 A series of bandit algorithms in Swift
Language:Swift4 0 00
juliakreutzer/bandit-neuralmonkey
Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).
Language:Python4 1 04
anishacharya/Bandits-Online-Learning
Simple Implementations of Bandit Algorithms in python
Language:Jupyter Notebook3 2 00
thiagopbueno/pybayesbandit
Bayesian bandits in Python3.
Language:Python3 0 01
juliakreutzer/bandit-cdec
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms
Language:C++2 2 0
rasros/combo
Language:Kotlin2 1 01
0x65-e/Stats-115
Homework Code for UCLA STATS 115 (Probabilistic Decision Making) Fall 22 Offering
Language:Python1 1 00
chan-yc/comp0089-reinforcement-learning
UCL COMP0089 Reinforcement Learning (2023/24)
Language:Jupyter Notebook1 1 00
florian/reinforcement-learning
Implementing RL algorithms
Language:Jupyter Notebook1 5 01
kapshaul/OnlineLearning
Repository of Online Learning algorithms, including Bandits, UCB, and more.
Language:Python1 1 00
shashankp914/Over-the-wire-wargames-Solutions
Detailed solution of solving wargames of over the wire which includes bandit and in future many more.
1 2 00
victor-iyi/policy-gradient
A policy gradient approach to a multi-armed bandit problem
Language:Jupyter Notebook1 1 0
vwang0/causal_inference
Language:Jupyter Notebook1 0 00
znreza/RL_Best_Presentation
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
1 0 00
dscolby/Whiteboard
A virtual whiteboard so I don't forget the ideas that come to me
0 1 00
fouratifares/RGL
Randomized Greedy Learning Under Full-bandit Feedback
Language:Python0 1 00
jpthanga/10-Arm-Bandit
Implementation of 10 Arm Bandit using RLGlue
Language:C0 1 00
SFV-CORE/Bandit_OverTheWire
Aqui irei explicar como passar de cada nível do CTF Bandit fornecido pela Over The Wire
0 1 00
zeroinfiniti/bandit-wargames
Leveling up on the Bandit Wargames
0 1 00
ad0x99/linux-4-fun
My Linux Notes
1 0
DenzilFrancisCrasta/bandit
Language:Python2 0
hartikainen/information-theoretic-bandit
Language:Python1 0
jonad/smartcab
Train a SmartCab how to drive using reinforcement learning.
Language:Jupyter Notebook2 0
victor-iyi/contextual-bandit
A Reinforcement Learning approach to a contextual bandit problem.
Language:Jupyter Notebook2 0
vitorhugo13/feup-mssi
Repository of code developed for the course MSSI @FEUP.
Language:Python3 0

bandit-learning

cair/TsetlinMachine

cair/pyTsetlinMachine

Nth-iteration-labs/contextual

SamRagusa/Checkers-Reinforcement-Learning

cair/convolutional-tsetlin-machine-tutorial

cair/pyTsetlinMachineParallel

thunfischtoast/LinUCB

mmalekzadeh/privacy-preserving-bandits

etiennekintzler/visualize_bandit_algorithms

Nth-iteration-labs/streamingbandit-ui

AntoineG92/Online-Clustering-of-Bandits-ENSAE

crenwick/Swiper

juliakreutzer/bandit-neuralmonkey

anishacharya/Bandits-Online-Learning

thiagopbueno/pybayesbandit

juliakreutzer/bandit-cdec

rasros/combo

0x65-e/Stats-115

chan-yc/comp0089-reinforcement-learning

florian/reinforcement-learning

kapshaul/OnlineLearning

shashankp914/Over-the-wire-wargames-Solutions

victor-iyi/policy-gradient

vwang0/causal_inference

znreza/RL_Best_Presentation

dscolby/Whiteboard

fouratifares/RGL

jpthanga/10-Arm-Bandit

SFV-CORE/Bandit_OverTheWire

zeroinfiniti/bandit-wargames

ad0x99/linux-4-fun

DenzilFrancisCrasta/bandit

hartikainen/information-theoretic-bandit

jonad/smartcab

victor-iyi/contextual-bandit

vitorhugo13/feup-mssi