Sökning: "Reinforcement"

Visar resultat 11 - 15 av 505 avhandlingar innehållade ordet Reinforcement.

11. Reinforcement Learning and Dynamical Systems

Författare :Björn Lindenberg; Karl-Olof Lindahl; Marc G. Bellemare; Linnéuniversitetet; []
Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; artificial intelligence; distributional reinforcement learning; Markov decision processes; Bellman operators; deep learning; multi-armed bandits; Bayesian bandits; conjugate priors; Thompson sampling; linear finite dynamical systems; cycle orbits; fixed-point systems; Mathematics; Matematik; Computer Science; Datavetenskap;

Sammanfattning : This thesis concerns reinforcement learning and dynamical systems in finite discrete problem domains. Artificial intelligence studies through reinforcement learning involves developing models and algorithms for scenarios when there is an agent that is interacting with an environment. LÄS MER
12. Reinforcement Learning for Active Visual Perception

Författare :Aleksis Pirinen; Mathematical Imaging Group; []
Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; computer vision; reinforcement learning; deep learning; active vision; object detection; human pose estimation; semantic segmentation;

Sammanfattning : Visual perception refers to automatically recognizing, detecting, or otherwise sensing the content of an image, video or scene. The most common contemporary approach to tackle a visual perception task is by training a deep neural network on a pre-existing dataset which provides examples of task success and failure, respectively. LÄS MER
13. Regret Minimization in Structured Reinforcement Learning

Författare :Damianos Tranos; Alexandre Proutiere; Yevgeny Seldin; KTH; []
Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Reinforcement Learning; Electrical Engineering; Elektro- och systemteknik;

Sammanfattning : We consider a class of sequential decision making problems in the presence of uncertainty, which belongs to the field of Reinforcement Learning (RL). Specifically, we study discrete Markov decision Processes (MDPs) which model a decision maker or agent that interacts with a stochastic and dynamic environment and receives feedback from it in the form of a reward. LÄS MER
14. Environmental actions on concrete exposed in marine and road environments and its response - Consequences for the initiation of chloride induced reinforcement corrosion

Författare :Anders Lindvall; Chalmers tekniska högskola; []
Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; corrosion; de-icing salt; road conditions; marine conditions; service life predictions; reinforcement; chloride; concrete; moisture conditions; environmental actions;

Sammanfattning : The object of the study presented here has been to describe, further explain and model the influence of the exposure conditions on reinforced concrete structures and the consequences on their expected service life. The focus has been on investigating and quantifying the exposure conditions for structures in marine and road conditions exposed to chloride ions. LÄS MER
15. Sensorimotor Robot Policy Training using Reinforcement Learning

Författare :Ali Ghadirzadeh; Mårten Björkman; Danica Kragic; Atsuto Maki; Ville Kyrki; KTH; []
Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Reinforcement Learning; Artificial Intelligence; Robot Learning; Sensorimotor; Policy Training; Computer Science; Datalogi;

Sammanfattning : Robots are becoming more ubiquitous in our society and taking over many tasks that were previously considered as human hallmarks. Many of these tasks, e.g. LÄS MER

Tidigare 1 2 3 4 5 6 7 Nästa

Sökning: "Reinforcement"

11. Reinforcement Learning and Dynamical Systems

12. Reinforcement Learning for Active Visual Perception

13. Regret Minimization in Structured Reinforcement Learning

14. Environmental actions on concrete exposed in marine and road environments and its response - Consequences for the initiation of chloride induced reinforcement corrosion

15. Sensorimotor Robot Policy Training using Reinforcement Learning

Sökningar just nu

Populära sökningar

Avhandlingar med många visningar igår (2024-04-17)