jax-bandits
PublicEpsilon-greedy, UCB and Thomson sampling for Bernoulli rewards in pure jitted Jax
Creat:2024-10-01T19:12:16
Update:2025-02-23T17:21:24
1
Stars
0
Stars Increase
Epsilon-greedy, UCB and Thomson sampling for Bernoulli rewards in pure jitted Jax