Avancerad sökning

Visar resultat 1 - 5 av 6 avhandlingar som matchar ovanstående sökkriterier.

  1. 1. Error Handling in Spoken Dialogue Systems : Managing Uncertainty, Grounding and Miscommunication

    Författare :Gabriel Skantze; Rolf Carlson; Alexander Rudnicky; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; speech communications; linguistics; Language technology; Språkteknologi;

    Sammanfattning : Due to the large variability in the speech signal, the speech recognition process constitutes the major source of errors in most spoken dialogue systems. A spoken dialogue system can never know for certain what the user is saying, it can only make hypotheses. LÄS MER

  2. 2. Adaptive Robot Presenters : Modelling Grounding in Multimodal Interaction

    Författare :Agnes Axelsson; Gabriel Skantze; Johan Boye; Elisabeth André; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; NATURVETENSKAP; NATURAL SCIENCES; Human-robot interaction; Dialogue; Presentation; Museum; Grounding; Multimodal; Feedback; Classification; Knowledge graphs; KG; KG-to-text; WebNLG; System; Learning; Large Language Model; LLM; människa-robot-interaktion; hri; dialog; presentation; museum; grundning; multimodal; multimodalitet; återmatning; klassifikation; kunskapsgraf; kg; kg-till-text; data-tilltext; webnlg; system; inlärning; lärande. stor språkmodell; llm; Speech and Music Communication; Tal- och musikkommunikation;

    Sammanfattning : This thesis addresses the topic of grounding in human-robot interaction, that is, the process by which the human and robot can ensure mutual understanding. To explore this topic, the scenario of a robot holding a presentation to a human audience is used, where the robot has to process multimodal feedback from the human in order to adapt the presentation to the human's level of understanding. LÄS MER

  3. 3. Predictive Modeling of Turn-Taking in Spoken Dialogue : Computational Approaches for the Analysis of Turn-Taking in Humans and Spoken Dialogue Systems

    Författare :Erik Ekstedt; Gabriel Skantze; Roger Moore; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; turn-taking; spoken dialog system; human computer interaction; Turtagning; talad dialog; människa-data interaktion; Datalogi; Computer Science; Människa-datorinteraktion; Human-computer Interaction; Speech and Music Communication; Tal- och musikkommunikation;

    Sammanfattning : Turn-taking in spoken dialogue represents a complex cooperative process wherein participants use verbal and non-verbal cues to coordinate who speaks and who listens, to anticipate speaker transitions, and to produce backchannels (e.g., “mhm”, “uh-huh”) at the right places. LÄS MER

  4. 4. Mutual Understanding in Situated Interactions with Conversational User Interfaces : Theory, Studies, and Computation

    Författare :Dimosthenis Kontogiorgos; Joakim Gustafsson; Gabriel Skantze; Catherine Pelachaud; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; human-computer interaction; social robots; smart-speakers; multimodal behaviours; social signal processing; common ground; dialogue and discourse; joint-construction tasks; embodiment; conversational failures; Computer Science; Datalogi;

    Sammanfattning : This dissertation presents advances in HCI through a series of studies focusing on task-oriented interactions between humans and between humans and machines. The notion of mutual understanding is central, also known as grounding in psycholinguistics, in particular how people establish understanding in conversations and what interactional phenomena are present in that process. LÄS MER

  5. 5. Data-driven Methods for Spoken Dialogue Systems : Applications in Language Understanding, Turn-taking, Error Detection, and Knowledge Acquisition

    Författare :Raveesh Meena; Gabriel Skantze; Joakim Gustafson; Helen Hastie; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Language Understanding; Turn-taking; Error Detection; Knowledge Acquisition; Crowdsourcing; semantisk tolkning talspråk; turtagning i dialogsystem; fel och missförstånd; crowdsourcing; dialogsystem; Tal- och musikkommunikation; Speech and Music Communication;

    Sammanfattning : Spoken dialogue systems are application interfaces that enable humans to interact with computers using spoken natural language. A major challenge for these systems is dealing with the ubiquity of variability—in user behavior, in the performance of the various speech and language processing sub-components, and in the dynamics of the task domain. LÄS MER