Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
Primary LanguageGoApache License 2.0Apache-2.0