Sökning: "word segmentation"

Visar resultat 1 - 5 av 9 avhandlingar innehållade orden word segmentation.

  1. 1. Segmenting and Tagging Text with Neural Networks

    Författare :Yan Shao; Joakim Nivre; Jörg Tiedemann; Christian Hardmeier; Yue Zhang; Uppsala universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; neural networks; sequence labelling; multilinguality; word segmentation; sentence segmentation; morpheme segmentation; transliteration; joint word segmentation and POS tagging;

    Sammanfattning : Segmentation and tagging of text are important preprocessing steps for higher-level natural language processing tasks. In this thesis, we apply a sequence labelling framework based on neural networks to various segmentation and tagging tasks, including sentence segmentation, word segmentation, morpheme segmentation, joint word segmentation and part-of-speech tagging, and named entity transliteration. LÄS MER

  2. 2. Machine-Printed and Handwritten Ethiopic Script Recognition

    Författare :Yaregal Assabie; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; NATURVETENSKAP; NATURAL SCIENCES; HMM; Handwriting Recognition; OCR; Direction Field Tensor; Ethiopic Character Recognition; Amharic Word Recognition; Structure Tensor;

    Sammanfattning : A written language is represented by using machine-printed or handwritten symbols called characters. For automatic recognition of written languages, handwritten script can be captured offline (by a scanner) and online (by electronic digital devices), whereas machine-printed text is captured offline. LÄS MER

  3. 3. A robust text processing technique applied to lexical error recovery

    Författare :Peter Ingels; Linköpings universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES;

    Sammanfattning : This thesis addresses automatic lexical error recovery and tokenization of corrupt text input. We propose a technique that can automatically correct misspellings, segmentation errors and real-word errors in a unified framework that uses both a model of language production and a model of the typing behavior, and which makes tokenization part of the recovery process. LÄS MER

  4. 4. Articulation Rate and Surprisal in Swedish Child-Directed Speech

    Författare :Johan Sjons; Mats Wirén; Robert Östling; Iris-Corinna Schwarz; Francisco Lacerda; Stockholms universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; articulation rate; information-theoretic surprisal; child-directed speech; final lengthening; spontaneous; longitudinal;

    Sammanfattning : Child-directed speech (CDS) differs from adult-directed speech (ADS) in several respects whose possible facilitating effects for language acquisition are still being studied. One such difference concerns articulation rate --- the number of linguistic units by the number of time units, excluding pauses --- which has been shown to be generally lower than in ADS. LÄS MER

  5. 5. Sounds of silence : Phonological awareness and written language in children with and without speech

    Författare :Janna Ferreira; Jerker Rönnberg; Åsa Wengelin; Stefan Gustafson; Annika Dahlgren Sandberg; Linköpings universitet; []
    Nyckelord :SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; phonological awareness; intervention; motor speech impairment; reading impairment; reading; spelling; literacy; läshandikapp; läsning; stavning; läs- och skrivförmåga; fonologisk medvetenhet; intervention; talhandikapp; Disability research; Handikappsforskning;

    Sammanfattning : Avhandlingens övergripande syfte var att undersöka fonologisk medvetenhet och skriftspråklig förmåga hos talande eller icke-talande barn, med lässvårigheter eller motoriska talsvårigheter. De huvudsakliga fynden i denna avhandling var: (1) För barn med lässvårigheter som befinner sig på en tidig nivå i sin läsutveckling bör intervention kring läs- och skrivförmågor fokusera på barnets svaghet snarare än styrkan vad gäller ordavkodning. LÄS MER