We're blogging machines!
Subscribe to feed
‹ 4 x 4 Minesweeper as an MDP • Optimal coding ›
July 19, 2012 in Multi-Armed Bandit Problem by hundalhh | Permalink
Several multi-armed bandit algorithms with logarithmic regret.