Sökning: "Markov decision-processes"
Visar resultat 1 - 5 av 18 avhandlingar innehållade orden Markov decision-processes.
1. Hidden Markov models : Identification, control and inverse filtering
Sammanfattning : The hidden Markov model (HMM) is one of the workhorse tools in, for example, statistical signal processing and machine learning. It has found applications in a vast number of fields, ranging all the way from bioscience to speech recognition to modeling of user interactions in social networks. LÄS MER
2. Efficient Exploration and Robustness in Controlled Dynamical Systems
Sammanfattning : In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient exploration in multi-task bandit models and model-free exploration in large Markov decision processes (MDPs). LÄS MER
3. Reinforcement Learning and Dynamical Systems
Sammanfattning : This thesis concerns reinforcement learning and dynamical systems in finite discrete problem domains. Artificial intelligence studies through reinforcement learning involves developing models and algorithms for scenarios when there is an agent that is interacting with an environment. LÄS MER
4. Priors and uncertainty in reinforcement learning
Sammanfattning : Handling uncertainty is an important part of decision-making. Leveraging uncertainty for guiding exploration to discover higher rewards has been a standard approach for a long time, using both ad hoc and more principled approaches. LÄS MER
5. The reinforcement learning method : A feasible and sustainable control strategy for efficient occupant-centred building operation in smart cities
Sammanfattning : Over half of the world’s population lives in urban areas, a trend which is expected to only grow as we move further into the future. With this increasing trend in urbanisation, challenges are presented in the form of the management of urban infrastructure systems. LÄS MER