Sökning: "Multi-Armed Bandit"

Visar resultat 1 - 5 av 13 avhandlingar innehållade orden Multi-Armed Bandit.

  1. 1. Efficient Online Learning under Bandit Feedback

    Författare :Stefan Magureanu; Alexandre Proutiere; Odalric-Ambrym Maillard; KTH; []
    Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; multi-armed bandits; reinforcement learning; learning to rank; Electrical Engineering; Elektro- och systemteknik;

    Sammanfattning : In this thesis we address the multi-armed bandit (MAB) problem with stochastic rewards and correlated arms. Particularly, we investigate the case when the expected rewards are a Lipschitz function of the arm and extend these results to bandits with arbitrary structure that is known to the decision maker. LÄS MER

  2. 2. Online Combinatorial Optimization under Bandit Feedback

    Författare :Mohammad Sadegh Talebi Mazraeh Shahi; Alexandre Proutiere; Vianney Perchet; KTH; []
    Nyckelord :Combinatorial Optimization; Online Learning; Multi-armed Bandits; Sequential Decision Making; Matematik; Mathematics; Datalogi; Computer Science;

    Sammanfattning : Multi-Armed Bandits (MAB) constitute the most fundamental model for sequential decision making problems with an exploration vs. exploitation trade-off. In such problems, the decision maker selects an arm in each round and observes a realization of the corresponding unknown reward distribution. LÄS MER

  3. 3. Combinatorial Semi-Bandit Methods for Navigation of Electric Vehicles

    Författare :Niklas Åkerblom; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; energy-efficient navigation; online learning; multi-armed bandit problem; Thompson sampling; combinatorial semi-bandit problem;

    Sammanfattning : Climate change is one of the most urgent global challenges humanity is currently facing. As major contributors of greenhouse gas emissions, the transport and automotive sectors have crucial roles to play in solving the problem. LÄS MER

  4. 4. Towards Optimal Algorithms For Online Decision Making Under Practical Constraints

    Författare :Aristide Tossou; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Reinforcement Learning; Multi-Agent Learning; Differential Privacy; Multi-Armed Bandit; Markov Decision Process; Fairness;

    Sammanfattning : Artificial Intelligence is increasingly being used in real-life applications such as driving with autonomous cars; deliveries with autonomous drones; customer support with chat-bots; personal assistant with smart speakers . . . LÄS MER

  5. 5. Privacy in the Age of Artificial Intelligence

    Författare :Aristide Tossou; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; sequential decision problem; multi-armed bandit; differential privacy;

    Sammanfattning : An increasing number of people are using the Internet in their daily life. Indeed, more than 40% of the world population have access to the Internet, while Facebook (one of the top social network on the web) is actively used by more than 1.3 billion users each day (Statista 2017). LÄS MER