lilianweng / Multi Armed Bandit
Play with the solutions to the multi-armed-bandit problem.
Stars: ✭ 245
Programming Languages
python
139335 projects - #7 most used programming language
multi-armed-bandit
This repo is set up for a blog post I wrote on "The Multi-Armed Bandit Problem and Its Solutions".
The result of a small experiment on solving a Bernoulli bandit with K = 10 slot machines, each with a randomly initialized reward probability.
- (Left) The plot of time step vs the cumulative regrets.
- (Middle) The plot of true reward probability vs estimated probability.
- (Right) The fraction of each action is picked during the 5000-step run.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].