Sökning: "Reinforcement"

Visar resultat 11 - 15 av 505 avhandlingar innehållade ordet Reinforcement.

  1. 11. Reinforcement Learning and Dynamical Systems

    Författare :Björn Lindenberg; Karl-Olof Lindahl; Marc G. Bellemare; Linnéuniversitetet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; artificial intelligence; distributional reinforcement learning; Markov decision processes; Bellman operators; deep learning; multi-armed bandits; Bayesian bandits; conjugate priors; Thompson sampling; linear finite dynamical systems; cycle orbits; fixed-point systems; Mathematics; Matematik; Computer Science; Datavetenskap;

    Sammanfattning : This thesis concerns reinforcement learning and dynamical systems in finite discrete problem domains. Artificial intelligence studies through reinforcement learning involves developing models and algorithms for scenarios when there is an agent that is interacting with an environment. LÄS MER

  2. 12. Reinforcement Learning for Active Visual Perception

    Författare :Aleksis Pirinen; Mathematical Imaging Group; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; computer vision; reinforcement learning; deep learning; active vision; object detection; human pose estimation; semantic segmentation;

    Sammanfattning : Visual perception refers to automatically recognizing, detecting, or otherwise sensing the content of an image, video or scene. The most common contemporary approach to tackle a visual perception task is by training a deep neural network on a pre-existing dataset which provides examples of task success and failure, respectively. LÄS MER

  3. 13. Regret Minimization in Structured Reinforcement Learning

    Författare :Damianos Tranos; Alexandre Proutiere; Yevgeny Seldin; KTH; []
    Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Reinforcement Learning; Electrical Engineering; Elektro- och systemteknik;

    Sammanfattning : We consider a class of sequential decision making problems in the presence of uncertainty, which belongs to the field of Reinforcement Learning (RL). Specifically, we study discrete Markov decision Processes (MDPs) which model a decision maker or agent that interacts with a stochastic and dynamic environment and receives feedback from it in the form of a reward. LÄS MER

  4. 14. Environmental actions on concrete exposed in marine and road environments and its response - Consequences for the initiation of chloride induced reinforcement corrosion

    Författare :Anders Lindvall; Chalmers tekniska högskola; []
    Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; corrosion; de-icing salt; road conditions; marine conditions; service life predictions; reinforcement; chloride; concrete; moisture conditions; environmental actions;

    Sammanfattning : The object of the study presented here has been to describe, further explain and model the influence of the exposure conditions on reinforced concrete structures and the consequences on their expected service life. The focus has been on investigating and quantifying the exposure conditions for structures in marine and road conditions exposed to chloride ions. LÄS MER

  5. 15. Sensorimotor Robot Policy Training using Reinforcement Learning

    Författare :Ali Ghadirzadeh; Mårten Björkman; Danica Kragic; Atsuto Maki; Ville Kyrki; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Reinforcement Learning; Artificial Intelligence; Robot Learning; Sensorimotor; Policy Training; Computer Science; Datalogi;

    Sammanfattning : Robots are becoming more ubiquitous in our society and taking over many tasks that were previously considered as human hallmarks. Many of these tasks, e.g. LÄS MER