Sökning: "Yevgeny Seldin"
Hittade 1 avhandling innehållade orden Yevgeny Seldin.
1. Regret Minimization in Structured Reinforcement Learning
Sammanfattning : We consider a class of sequential decision making problems in the presence of uncertainty, which belongs to the field of Reinforcement Learning (RL). Specifically, we study discrete Markov decision Processes (MDPs) which model a decision maker or agent that interacts with a stochastic and dynamic environment and receives feedback from it in the form of a reward. LÄS MER
Resultatsidor:
1