singhsidhukuldeep/contextual-bandits

A comprehensive Python library implementing a variety of contextual and non-contextual multi-armed bandit algorithms—including LinUCB, Epsilon-Greedy, Upper Confidence Bound (UCB), Thompson Sampling, KernelUCB, NeuralLinearBandit, and DecisionTreeBandit—designed for reinforcement learning applications

PythonGPL-3.0

Stargazers

DominicRed
Shanghai
ndamulelonemakh
Rooral Platforms