Bandit Algorithms

Tor Lattimore; Csaba Szepesvári

doi:10.1017/9781108571401

Last updated 10th July 2024: Online ordering is currently unavailable due to technical issues. We apologise for any delays responding to customers while we resolve this. For further updates please visit our website https://www.cambridge.org/news-and-insights/technical-incident

Skip to main content Accessibility help

Home
Books
Bandit Algorithms

Bandit Algorithms

- Get access
  
  Buy a print copy
  
  Check if you have access via personal or institutional login
  
  Log in Register
Cited by 439
Cited by
- 439
Crossref Citations

This Book has been cited by the following publications. This list is generated based on data provided by Crossref.

Cheung, Wang Chi Simchi-Levi, David and Zhu, Ruihao 2018. Learning to Optimize Under Non-Stationarity. SSRN Electronic Journal ,

CrossRef

Google Scholar

Jiang, Ray Chiappa, Silvia Lattimore, Tor György, András and Kohli, Pushmeet 2019. Degenerate Feedback Loops in Recommender Systems. p. 383.

CrossRef

Google Scholar

Fouché, Edouard Komiyama, Junpei and Böhm, Klemens 2019. Scaling Multi-Armed Bandit Algorithms. p. 1449.

CrossRef

Google Scholar

Melesko, Jaroslav and Novickij, Vitalij 2019. Computer Adaptive Testing Using Upper-Confidence Bound Algorithm for Formative Assessment. Applied Sciences, Vol. 9, Issue. 20, p. 4303.

CrossRef

Google Scholar

Ghosh, Debamita Verma, Arun and Hanawal, Manjesh K. 2020. Learning and Fairness in Energy Harvesting: A Maximin Multi-Armed Bandits Approach. p. 1.

CrossRef

Google Scholar

Casalé, Balthazar Di Molfetta, Giuseppe Kadri, Hachem and Ralaivola, Liva 2020. Quantum bandits. Quantum Machine Intelligence, Vol. 2, Issue. 1,

CrossRef

Google Scholar

Ouyang, Yi Gagrani, Mukul and Jain, Rahul 2020. Posterior Sampling-Based Reinforcement Learning for Control of Unknown Linear Systems. IEEE Transactions on Automatic Control, Vol. 65, Issue. 8, p. 3600.

CrossRef

Google Scholar

Simchi-Levi, David and Xu, Yunzong 2020. Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability. SSRN Electronic Journal ,

CrossRef

Google Scholar

Garbar, Sergey 2020. Invariant description of UCB strategy for multi-armed bandits for batch processing scenario. p. 75.

CrossRef

Google Scholar

Jia, Huiwen Shi, Cong and Shen, Siqian 2020. Online Learning and Pricing for Service Systems with Reusable Resources. SSRN Electronic Journal ,

CrossRef

Google Scholar

Alcaraz, Juan J. Ayala‐Romero, Jose A. Vales‐Alonso, Javier and Losilla‐López, Fernando 2020. Online reinforcement learning for adaptive interference coordination. Transactions on Emerging Telecommunications Technologies, Vol. 31, Issue. 10,

CrossRef

Google Scholar

Li, Guangxia Lu, Xiao and Niyato, Dusit 2020. A Bandit Approach for Mode Selection in Ambient Backscatter-Assisted Wireless-Powered Relaying. IEEE Transactions on Vehicular Technology, Vol. 69, Issue. 8, p. 9190.

CrossRef

Google Scholar

Rubies-Royo, Vicenc Mazumdar, Eric Dong, Roy Tomlin, Claire and Sastry, S. Shankar 2020. Expert Selection in High-Dimensional Markov Decision Processes. p. 3604.

CrossRef

Google Scholar

Agrawal, Priyank and Tulabandhula, Theja 2020. Multi-Agent Systems and Agreement Technologies. Vol. 12520, Issue. , p. 159.

CrossRef

Google Scholar

Youssef, Marie-Josepha Veeravalli, Venugopal V. Farah, Joumana and Nour, Charbel Abdel 2020. Stochastic Multi-Player Multi-Armed Bandits with Multiple Plays for Uncoordinated Spectrum Access. p. 1.

CrossRef

Google Scholar

Golrezaei, Negin Manshadi, Vahideh Schneider, Jon and Sekar, Shreyas 2020. Learning Product Rankings Robust to Fake Users. SSRN Electronic Journal ,

CrossRef

Google Scholar

Zhang, Haixiang and Zheng, Zeyu 2020. Discrete Convex Simulation Optimization. SSRN Electronic Journal,

CrossRef

Google Scholar

Kolnogorov, Alexander and Grunev, Denis 2020. Exponential Two-Armed Bandit Problem. p. 79.

CrossRef

Google Scholar

Rafieian, Omid and Yoganarasimhan, Hema 2020. How Does Variety of Previous Ads Influence Consumer’s Ad Response?. SSRN Electronic Journal ,

CrossRef

Google Scholar

Li, Chang Feng, Haoyun and Rijke, Maarten de 2020. Cascading Hybrid Bandits: Online Learning to Rank for Relevance and Diversity. p. 33.

CrossRef

Google Scholar

Download full list

Tor Lattimore, University of Alberta, Csaba Szepesvári, University of Alberta

Publisher:: Cambridge University Press
Online publication date:: July 2020
Print publication year:: 2020
Online ISBN:: 9781108571401
DOI:: https://doi.org/10.1017/9781108571401

Subjects:: Engineering, Computer Science, Pattern Recognition and Machine Learning, Control Systems and Optimisation

Information

Contents

Metrics

Decision-making in the face of uncertainty is a significant challenge in machine learning, and the multi-armed bandit model is a commonly used framework to address it. This comprehensive and rigorous introduction to the multi-armed bandit problem examines all the major settings, including stochastic, adversarial, and Bayesian frameworks. A focus on both mathematical intuition and carefully worked proofs makes this an excellent reference for established researchers and a helpful resource for graduate students in computer science, engineering, statistics, applied mathematics and economics. Linear bandits receive special attention as one of the most useful models in applications, while other chapters are dedicated to combinatorial bandits, ranking, non-stationary problems, Thompson sampling and pure exploration. The book ends with a peek into the world beyond bandits with an introduction to partial monitoring and learning in Markov decision processes.

'This year marks the 68th anniversary of ‘multi-armed bandits’ introduced by Herbert Robbins in 1952, and the 35th anniversary of his 1985 paper with me that advanced multi-armed bandit theory in new directions via the concept of ‘regret’ and a sharp asymptotic lower bound for the regret. This vibrant subject has attracted important multidisciplinary developments and applications. Bandit Algorithms gives it a comprehensive and up-to-date treatment, and meets the need for such books in instruction and research in the subject, as in a new course on contextual bandits and recommendation technology that I am developing at Stanford.'

Tze L. Lai - Stanford University

'This is a timely book on the theory of multi-armed bandits, covering a very broad range of basic and advanced topics. The rigorous treatment combined with intuition makes it an ideal resource for anyone interested in the mathematical and algorithmic foundations of a fascinating and rapidly growing field of research.'

Nicolò Cesa-Bianchi - University of Milan

'The field of bandit algorithms, in its modern form, and driven by prominent new applications, has been taking off in multiple directions. The book by Lattimore and Szepesvári is a timely contribution that will become a standard reference on the subject. The book offers a thorough exposition of an enormous amount of material, neatly organized in digestible pieces. It is mathematically rigorous, but also pleasant to read, rich in intuition and historical notes, and without superfluous details. Highly recommended.'

John Tsitsiklis - Massachusetts Institute of Technology

Metrics

Altmetric attention score

Total number of HTML views: 0

Total number of PDF views: 0 *

Loading metrics...

Total views: 0 *

Loading metrics...

* Views captured on Cambridge Core between #date#. This data will be updated every 24 hours.

Usage data cannot currently be displayed.

Bandit Algorithms

Book description

Reviews

Refine List

Actions for selected content:

Save Search

Contents

Metrics

Altmetric attention score

Full text views

Book summary page views