GitHub

This is the "Data Stream Mining" project of Yuyan Zhao and Bérénice Jaulmes.

Documentation

bayes_ucb.py

It implements the BayesUCB algorithm^[1] for the multi-armed bandit problem with the River library. We chose to use a Beta distribution to compute the posterior distribution, and use the p-th quantile as the upper confidence bound (UCB) for each arm. The arm with the highest UCB is then pulled. And the posterior distribution for the pulled arm is updated.

test.py

It evaluates the performance of the BayesUCB policy defined, and compares it with the existing bandit algorithms in River. The result is shown in the figure below:

/previous_version

Our current code has been reorganized by Max via Pull requests to the River. The previous versions are also available at /previous_version

Reference

[1] Kaufmann, Emilie, Olivier Cappé, and Aurélien Garivier. "On Bayesian upper confidence bounds for bandit problems." Artificial intelligence and statistics. PMLR, 2012.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
previous version		previous version
bayes_ucb.py		bayes_ucb.py
readme.md		readme.md
result.png		result.png
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Documentation

Reference

About

Releases

Packages

Contributors 2

Languages

ormarv/Project

Folders and files

Latest commit

History

Repository files navigation

Documentation

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages