Skip to content

Releases: thetawom/mabby

v0.1.2 Release

04 May 18:14
Compare
Choose a tag to compare

What's Changed

  • fix: use instance rng for bandit play by @ew2664 in #93
  • fix: Raise ValueError for invalid arm params by @dag2226 in #104
  • feat: add epsilon-first strategy by @ew2664 in #105

New Contributors

Full Changelog: v0.1.1...v0.1.2

v0.1.1 Release

20 Mar 03:10
Compare
Choose a tag to compare

mabby is a library for simulating multi-armed bandits, with the following features:

  • running simulations with different strategies on a configurable set of arms
  • tracking for regret, cumulative regret, optimality, reward, and cumulative reward metrics
  • visualizing the tracked metrics, allowing comparison between different strategies

mabby also currently supports and includes:

  • implementations for epsilon-greedy, UCB1, and beta Thompson sampling strategies
  • implementations for arms with Bernoulli and Gaussian reward distributions
  • custom strategies and arms through sub-classing the abstract Strategy and Arm classes