Avancerad sökning

Hittade 2 avhandlingar som matchar ovanstående sökkriterier.

  1. 1. Understanding and Evaluating Policies for Sequential Decision-Making

    Författare :Anton Matsson; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Observational data; Sequential decision-making; Reinforcement learning; Off-policy evaluation; Rheumatoid arthritis;

    Sammanfattning : Sequential-decision making is a critical component of many complex systems, such as finance, healthcare, and robotics. The long-term goal of a sequential decision-making process is to optimize the policy under which decisions are made. LÄS MER

  2. 2. Efficient Exploration and Robustness in Controlled Dynamical Systems

    Författare :Alessio Russo; Alexandre Proutiere; Henrik Sandberg; Marcello Restelli; Nikolai Matni; Jana Tumova; Mikael Asplund; KTH; []
    Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; reinforcement learning; efficient exploration; bandit algorithms; adversarial attacks; conformal prediction; data posioning; markov decision processes; attack detectability; optimal control; adaptive control; Electrical Engineering; Elektro- och systemteknik; Datalogi; Computer Science; Mathematical Statistics; Matematisk statistik;

    Sammanfattning : In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient exploration in  multi-task bandit models and model-free exploration in large Markov decision processes (MDPs). LÄS MER