Avancerad sökning

Visar resultat 1 - 5 av 19 avhandlingar som matchar ovanstående sökkriterier.

  1. 1. Why the pond is not outside the frog? Grounding in contextual representations by neural language models

    Författare :Mehdi Ghanimifard; Göteborgs universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Computational linguistics; Language grounding; Spatial language; Distributional semantics; Computer vision; Language modelling; Vision and language; Neural language model; Grounded language model;

    Sammanfattning : In this thesis, to build a multi-modal system for language generation and understanding, we study grounded neural language models. Literature in psychology informs us that spatial cognition involves different aspects of knowledge that include visual perception and human interaction with the world. LÄS MER

  2. 2. Adaptive Robot Presenters : Modelling Grounding in Multimodal Interaction

    Författare :Agnes Axelsson; Gabriel Skantze; Johan Boye; Elisabeth André; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; NATURVETENSKAP; NATURAL SCIENCES; Human-robot interaction; Dialogue; Presentation; Museum; Grounding; Multimodal; Feedback; Classification; Knowledge graphs; KG; KG-to-text; WebNLG; System; Learning; Large Language Model; LLM; människa-robot-interaktion; hri; dialog; presentation; museum; grundning; multimodal; multimodalitet; återmatning; klassifikation; kunskapsgraf; kg; kg-till-text; data-tilltext; webnlg; system; inlärning; lärande. stor språkmodell; llm; Speech and Music Communication; Tal- och musikkommunikation;

    Sammanfattning : This thesis addresses the topic of grounding in human-robot interaction, that is, the process by which the human and robot can ensure mutual understanding. To explore this topic, the scenario of a robot holding a presentation to a human audience is used, where the robot has to process multimodal feedback from the human in order to adapt the presentation to the human's level of understanding. LÄS MER

  3. 3. A Picture is Worth a Thousand Words: Natural Language Processing in Context

    Författare :Lovisa Hagström; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; NLP; Knowledge representation; Neural network; Vision-and-language models; Grounding; BERT;

    Sammanfattning : Modern NLP models learn language from lexical co-occurrences. While this method has allowed for significant breakthroughs, it has also exposed potential limitations of modern NLP methods. For example, NLP models are prone to hallucinate, represent a biased world view and may learn spurious correlations to solve the data instead of the task at hand. LÄS MER

  4. 4. Error Handling in Spoken Dialogue Systems : Managing Uncertainty, Grounding and Miscommunication

    Författare :Gabriel Skantze; Rolf Carlson; Alexander Rudnicky; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; speech communications; linguistics; Language technology; Språkteknologi;

    Sammanfattning : Due to the large variability in the speech signal, the speech recognition process constitutes the major source of errors in most spoken dialogue systems. A spoken dialogue system can never know for certain what the user is saying, it can only make hypotheses. LÄS MER

  5. 5. Teaching mobile robots to use spatial words

    Författare :Simon Dobnik; Göteborgs universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; situated interaction; spatial descriptions; language and perception; perceptual grounding; dialogue; natural language semantics; machine learning;

    Sammanfattning : The meaning of spatial words can only be evaluated by establishing a reference to the properties of the environment in which the word is used. For example, in order to evaluate what is to the left of something or how fast is fast in a given context, we need to evaluate properties such as the position of objects in the scene, their typical function and behaviour, the size of the scene and the perspective from which the scene is viewed. LÄS MER