Sökning: "Natural Language Parsing"

Visar resultat 16 - 20 av 38 avhandlingar innehållade orden Natural Language Parsing.

  1. 16. Morphosyntactic Corpora and Tools for Persian

    Författare :Mojgan Seraji; Joakim Nivre; Carina Jahani; Jan Hajic; Uppsala universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Persian; language technology; corpus; treebank; preprocessing; segmentation; part-of-speech tagging; dependency parsing; Computational Linguistics; Datorlingvistik;

    Sammanfattning : This thesis presents open source resources in the form of annotated corpora and modules for automatic morphosyntactic processing and analysis of Persian texts. More specifically, the resources consist of an improved part-of-speech tagged corpus and a dependency treebank, as well as tools for text normalization, sentence segmentation, tokenization, part-of-speech tagging, and dependency parsing for Persian. LÄS MER

  2. 17. Translation as Linear Transduction : Models and Algorithms for Efficient Learning in Statistical Machine Translation

    Författare :Markus Saers; Joakim Nivre; Anna Sågvall Hein; Dekai Wu; Kevin Knight; Uppsala universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; linear transduction; linear transduction grammar; inversion transduction; zipper finite-state automaton; zipper finite-state transducer; formal language theory; formal transduction theory; translation; automatic translation; machine translation; statistical machine translation; Computational linguistics; Datorlingvistik; Language technology; Språkteknologi; Computational Linguistics; Datorlingvistik;

    Sammanfattning : Automatic translation has seen tremendous progress in recent years, mainly thanks to statistical methods applied to large parallel corpora. Transductions represent a principled approach to modeling translation, but existing transduction classes are either not expressive enough to capture structural regularities between natural languages or too complex to support efficient statistical induction on a large scale. LÄS MER

  3. 18. Multilingual Abstractions: Abstract Syntax Trees and Universal Dependencies

    Författare :Kolachina Prasanth; Göteborgs universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Grammatical Framework; Universal Dependencies; Natural Language Processing; multilinguality; abstract syntax trees; dependency trees; multilingual generation; multilingual parsers;

    Sammanfattning : This thesis studies the connections between parsing friendly representations and interlingua grammars developed for multilingual language generation. Parsing friendly representations refer to dependency tree representations that can be used for robust, accurate and scalable analysis of natural language text. LÄS MER

  4. 19. A novel approach to text classification

    Författare :Niklas Zechner; Johanna Björklund; Efstathios Stamatatos; Umeå universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; Text classification; natural language processing; automata; Computer Science; datalogi;

    Sammanfattning : This thesis explores the foundations of text classification, using both empirical and deductive methods, with a focus on author identification and syntactic methods. We strive for a thorough theoretical understanding of what affects the effectiveness of classification in general. LÄS MER

  5. 20. Heuristisk analys med Diderichsens satsschema - Tillämpningar för svensk text : Heuristic Analysis with Diderichsen’s Sentence Schema – Applications for Swedish Text

    Författare :Kenneth Wilhelmsson; Göteborgs universitet; []
    Nyckelord :HUMANIORA; HUMANITIES; HUMANIORA; HUMANITIES; NATURVETENSKAP; NATURAL SCIENCES; Diderichsens nordiska satsschema; positionsgrammatik; fältgrammatik; licensieringstekniker; Stockholm Umeå Corpus; schemaparsning; rangbaserad chunkning; spetsställning; parafrasgenerering; frågegenerering; naturligt språk-frågesystem; svenska WordNet;

    Sammanfattning : A heuristic method for parsing Swedish text, heuristic schema parsing, is described and implemented. Focusing on main clause (primary) analysis, a collection of licensing techniques for removing non-primary verb candidates is employed, leaving e.g. LÄS MER