Sökning: "spelling normalisation"

Hittade 2 avhandlingar innehållade orden spelling normalisation.

  1. 1. Spelling Normalisation and Linguistic Analysis of Historical Text for Information Extraction

    Författare :Eva Pettersson; Joakim Nivre; Beáta Megyesi; Michael Piotrowski; Uppsala universitet; []
    Nyckelord :NLP for historical text; spelling normalisation; digital humanities; information extraction; character-based statistical machine translation; SMT; Levenshtein edit distance; language technology; computational linguistics; Computational Linguistics; Datorlingvistik;

    Sammanfattning : Historical text constitutes a rich source of information for historians and other researchers in humanities. Many texts are however not available in an electronic format, and even if they are, there is a lack of NLP tools designed to handle historical text. LÄS MER

  2. 2. Natural Language Processing for Low-resourced Code-switched Colloquial Languages – The Case of Algerian Language

    Författare :Wafia Adouane; Göteborgs universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Natural language processing; Deep neural networks; Low-resourced language; Colloquial language; Code-switch; Dialectal Arabic; User-generated data; Non-standardised orthography; Algerian language;

    Sammanfattning : In this thesis we explore to what extent deep neural networks (DNNs), trained end-to-end, can be used to perform natural language processing tasks for code-switched colloquial languages lacking both large automated data and processing tools, for instance tokenisers, morpho-syntactic and semantic parsers, etc. We opt for an end-to-end learning approach because this kind of data is hard to control due to its high orthographic and linguistic variability. LÄS MER