Sökning: "Markov decision-processes"

Visar resultat 1 - 5 av 18 avhandlingar innehållade orden Markov decision-processes.

1. Hidden Markov models : Identification, control and inverse filtering

Författare :Robert Mattila; Bo Wahlberg; Eric Moulines; KTH; []
Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; hidden markov models; system identification; method of moments; inverse filtering; abdominal aortic aneurysm; medical; markov decision process; structure; Electrical Engineering; Elektro- och systemteknik;

Sammanfattning : The hidden Markov model (HMM) is one of the workhorse tools in, for example, statistical signal processing and machine learning. It has found applications in a vast number of fields, ranging all the way from bioscience to speech recognition to modeling of user interactions in social networks. LÄS MER
2. Efficient Exploration and Robustness in Controlled Dynamical Systems

Författare :Alessio Russo; Alexandre Proutiere; Henrik Sandberg; Marcello Restelli; Nikolai Matni; Jana Tumova; Mikael Asplund; KTH; []
Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; reinforcement learning; efficient exploration; bandit algorithms; adversarial attacks; conformal prediction; data posioning; markov decision processes; attack detectability; optimal control; adaptive control; Electrical Engineering; Elektro- och systemteknik; Datalogi; Computer Science; Mathematical Statistics; Matematisk statistik;

Sammanfattning : In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient exploration in multi-task bandit models and model-free exploration in large Markov decision processes (MDPs). LÄS MER
3. Reinforcement Learning and Dynamical Systems

Författare :Björn Lindenberg; Karl-Olof Lindahl; Marc G. Bellemare; Linnéuniversitetet; []
Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; artificial intelligence; distributional reinforcement learning; Markov decision processes; Bellman operators; deep learning; multi-armed bandits; Bayesian bandits; conjugate priors; Thompson sampling; linear finite dynamical systems; cycle orbits; fixed-point systems; Mathematics; Matematik; Computer Science; Datavetenskap;

Sammanfattning : This thesis concerns reinforcement learning and dynamical systems in finite discrete problem domains. Artificial intelligence studies through reinforcement learning involves developing models and algorithms for scenarios when there is an agent that is interacting with an environment. LÄS MER
4. Priors and uncertainty in reinforcement learning

Författare :Emilio Jorge; Chalmers tekniska högskola; []
Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; Bayesian reinforcement learning; reinforcement learning; Minimax; Markov decision processes;

Sammanfattning : Handling uncertainty is an important part of decision-making. Leveraging uncertainty for guiding exploration to discover higher rewards has been a standard approach for a long time, using both ad hoc and more principled approaches. LÄS MER
5. The reinforcement learning method : A feasible and sustainable control strategy for efficient occupant-centred building operation in smart cities

Författare :Ross May; Kenneth Carling; Mengjie Han; Pascal Rebreyend; Zoltan Nagy; Högskolan Dalarna; []
Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; NATURVETENSKAP; NATURAL SCIENCES; Markov decision processes; Reinforcement learning; Control; Building; Indoor comfort; Occupant; Complex Systems – Microdata Analysis; Komplexa system - mikrodataanalys;

Sammanfattning : Over half of the world’s population lives in urban areas, a trend which is expected to only grow as we move further into the future. With this increasing trend in urbanisation, challenges are presented in the form of the management of urban infrastructure systems. LÄS MER

Resultatsidor:

1 2 3 4 Nästa