Releases · thetawom/mabby · GitHub

04 May 18:14

thetawom

v0.1.2 Release Latest

Latest

What's Changed

fix: use instance rng for bandit play by @ew2664 in #93
fix: Raise ValueError for invalid arm params by @dag2226 in #104
feat: add epsilon-first strategy by @ew2664 in #105

New Contributors

@dag2226 made their first contribution in #104

Full Changelog: v0.1.1...v0.1.2

Contributors

thetawom and dag2226

Assets 2

20 Mar 03:10

thetawom

v0.1.1 Release

mabby is a library for simulating multi-armed bandits, with the following features:

running simulations with different strategies on a configurable set of arms
tracking for regret, cumulative regret, optimality, reward, and cumulative reward metrics
visualizing the tracked metrics, allowing comparison between different strategies

mabby also currently supports and includes:

implementations for epsilon-greedy, UCB1, and beta Thompson sampling strategies
implementations for arms with Bernoulli and Gaussian reward distributions
custom strategies and arms through sub-classing the abstract Strategy and Arm classes

Assets 2