Bandit algorithms / Tor Lattimore and Csaba Szepesvari.
"Decision-making in the face of uncertainty is a significant challenge in machine learning, and the multi-armed bandit model is a commonly used framework to address it. This comprehensive and rigorous introduction to the multi-armed bandit problem examines all the major settings, including stoc...
Saved in:
Online Access: |
Full Text (via Cambridge) |
---|---|
Main Authors: | , |
Format: | Electronic eBook |
Language: | English |
Published: |
Cambridge ; New York, NY :
Cambridge University Press,
2020.
|
Subjects: |