  1. 1. Mining Speech Sounds : Machine Learning Methods for Automatic Speech Recognition and Analysis

    Författare :Giampiero Salvi; Björn Granström; Torbjørn Svendsen; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; speech; machine learning; data mining; signal processing; Computer science; Datavetenskap;

    This thesis collects studies on machine learning methods applied to speech technology and speech research problems. The six research papers included in this thesis are organised in three main areas. The first group of studies were carried out within the European project Synface.

  2. 2. Estimation of Speaker Age : Effects of Speech Properties and Speech Material

    Författare :Sara Skoog Waller; Mårten Eriksson; Billy Jansson; Mittuniversitetet; []
    Nyckelord :SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; Age estimation; Voice perception; Speech properties; Speech rate; Vocal disguise; Age disguise; Accuracy; Confidence; Spontaneous speech;

    The aim of this thesis was to investigate factors related to accuracy in estimation of speaker age and the role of certain speech properties in perception and manipulation of speaker age, as well as their interaction with the speech material that the age estimates were based on. This thesis consists of three studies.

  3. 3. The /k/s, the /t/s, and the inbetweens : Novel approaches to examining the perceptual consequences of misarticulated speech

    Författare :Sofia Strömbergsson; David House; Åsa Wengelin; Benjamin Munson; KTH; []
    Nyckelord :HUMANIORA; HUMANITIES; speech perception; speech disorders; speech synthesis; speech analysis; Tal- och musikkommunikation; Speech and Music Communication;

    This thesis comprises investigations of the perceptual consequences of children's misarticulated speech – as perceived by clinicians, by everyday listeners, and by the children themselves. By inviting methods from other areas to the study of speech disorders, this work demonstrates some successful cases of cross-fertilization.

  4. 4. Disfluency in Swedish human–human and human–machine travel booking dialogues

    Författare :Robert Eklund; Lars Ahrenberg; Linköpings universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Speech; Speech disorders; Speech intelligibility; Speech perception; Computer linguistics; phonetic; Man-computer-interaction; Computational linguistics; Datorlingvistik;

    This thesis studies disfluency in spontaneous Swedish speech, i.e., the occurrence of hesitation phenomena like eh, öh, truncated words, repetitions and repairs, mispronunciations, truncated words and so on.

  5. 5. Semantic Framing of Speech : Emotional and Topical Cues in Perception of Poorly Specified Speech

    Författare :Björn Lidestam; Björn Lyxell; Staffan Hygge; Linköpings universitet; []
    Nyckelord :SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; Speech perception; speechreading; facial expressions; priming; phonemes; semantics; lipreading; auditory perception; cognition; paralinguistics; emotional content; Läppavläsning; Psychology; Psykologi;

    The general aim of this thesis was to test the effects of paralinguistic (emotional) and prior contextual (topical) cues on perception of poorly specified visual, auditory, and audiovisual speech. The specific purposes were to (1) examine if facially displayed emotions can facilitate speechreading performance; (2) to study the mechanism for such facilitation; (3) to map information-processing factors that are involved in processing of poorly specified speech; and (4) to present a comprehensive conceptual framework for speech perception, with specification of the signal being considered.