Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems
Author: Sébastien Bubeck
Publisher:
Total Pages: 137
Release: 2012
Genre: Artificial intelligence
ISBN: 9781601986276


Download Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems Book in PDF, Epub and Kindle

Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. In this monograph, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model.


Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
Language: en
Pages: 138
Authors: Sébastien Bubeck
Categories: Computers
Type: BOOK - Published: 2012 - Publisher: Now Pub

GET EBOOK

In this monograph, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed
Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems
Language: en
Pages: 137
Authors: Sébastien Bubeck
Categories: Artificial intelligence
Type: BOOK - Published: 2012 - Publisher:

GET EBOOK

Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between
Introduction to Multi-Armed Bandits
Language: en
Pages: 306
Authors: Aleksandrs Slivkins
Categories: Computers
Type: BOOK - Published: 2019-10-31 - Publisher:

GET EBOOK

Multi-armed bandits is a rich, multi-disciplinary area that has been studied since 1933, with a surge of activity in the past 10-15 years. This is the first boo
Algorithmic Learning Theory
Language: en
Pages: 410
Authors: Ricard Gavaldà
Categories: Computers
Type: BOOK - Published: 2009-09-29 - Publisher: Springer

GET EBOOK

This book constitutes the refereed proceedings of the 20th International Conference on Algorithmic Learning Theory, ALT 2009, held in Porto, Portugal, in Octobe
Bandit Algorithms
Language: en
Pages: 537
Authors: Tor Lattimore
Categories: Business & Economics
Type: BOOK - Published: 2020-07-16 - Publisher: Cambridge University Press

GET EBOOK

A comprehensive and rigorous introduction for graduate students and researchers, with applications in sequential decision-making problems.