Sökning: "Lipschitz Bandits"
Hittade 2 avhandlingar innehållade orden Lipschitz Bandits.
1. Structured Stochastic Bandits
Sammanfattning : In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlated arms. Particularly, we investigate the case when the expected rewards are a Lipschitz function of the arm, and the learning to rank problem, as viewed from a MAB perspective. LÄS MER
2. Efficient Online Learning under Bandit Feedback
Sammanfattning : In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlated arms. Particularly, we investigate the case when the expected rewards are a Lipschitz function of the arm and extend these results to bandits with arbitrary structure that is known to the decision maker. LÄS MER