Sökning: "Off-policy evaluation"
Hittade 2 avhandlingar innehållade orden Off-policy evaluation.
1. Understanding and Evaluating Policies for Sequential Decision-Making
Sammanfattning : Sequential-decision making is a critical component of many complex systems, such as finance, healthcare, and robotics. The long-term goal of a sequential decision-making process is to optimize the policy under which decisions are made. LÄS MER
2. Efficient Exploration and Robustness in Controlled Dynamical Systems
Sammanfattning : In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient exploration in multi-task bandit models and model-free exploration in large Markov decision processes (MDPs). LÄS MER
Resultatsidor:
1