Sökning: "treebank parsing"

Hittade 5 avhandlingar innehållade orden treebank parsing.

  1. 1. Inductive Dependency Parsing of Natural Language Text

    Författare :Joakim Nivre; Walter Daelemans; Växjö universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; natural language parsing; dependency parsing; memory-based learning; treebank parsing; Systems engineering; Systemteknik; Computer and Information Sciences Computer Science; Data- och informationsvetenskap;

    Sammanfattning : This thesis investigates new methods for syntactic parsing of unrestricted natural language text under requirements of robustness and disambiguation. A parsing system is required to assign to every sentence in a text at least one analysis (robustness) and at most one analysis (disambiguation). LÄS MER

  2. 2. The Multilingual Forest : Investigating High-quality Parallel Corpus Development

    Författare :Yvonne Adesam; Martin Volk; Joakim Nivre; Koenraad de Smedt; Stockholms universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; treebank; syntax; alignment; corpus; annotation projection; multilingual; tagging; parsing; datorlingvistik; Computational Linguistics;

    Sammanfattning : This thesis explores the development of parallel treebanks, collections of language data consisting of texts and their translations, with syntactic annotation and alignment, linking words, phrases, and sentences to show translation equivalence. We describe the semi-manual annotation of the SMULTRON parallel treebank, consisting of 1,000 sentences in English, German and Swedish. LÄS MER

  3. 3. Morphosyntactic Corpora and Tools for Persian

    Författare :Mojgan Seraji; Joakim Nivre; Carina Jahani; Jan Hajic; Uppsala universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Persian; language technology; corpus; treebank; preprocessing; segmentation; part-of-speech tagging; dependency parsing; Computational Linguistics; Datorlingvistik;

    Sammanfattning : This thesis presents open source resources in the form of annotated corpora and modules for automatic morphosyntactic processing and analysis of Persian texts. More specifically, the resources consist of an improved part-of-speech tagged corpus and a dependency treebank, as well as tools for text normalization, sentence segmentation, tokenization, part-of-speech tagging, and dependency parsing for Persian. LÄS MER

  4. 4. Tree Transformations in Inductive Dependency Parsing

    Författare :Jens Nilsson; Joakim Nivre; Pierre Nugues; Växjö universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Inductive Dependency Parsing; Dependency Structure; Tree Transformation; Non-projectivity; Coordination; Verb Group; Language technology; Språkteknologi; Computer and Information Sciences Computer Science; Data- och informationsvetenskap;

    Sammanfattning : This licentiate thesis deals with automatic syntactic analysis, or parsing, of natural languages. A parser constructs the syntactic analysis, which it learns by looking at correctly analyzed sentences, known as training data. The general topic concerns manipulations of the training data in order to improve the parsing accuracy. LÄS MER

  5. 5. Tree Transformations in Inductive Dependency Parsing

    Författare :Jens Nilsson; Joakim Nivre; Pierre Nugues; Växjö universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Inductive Dependency Parsing; Dependency Structure; Tree Transformation; Non-projectivity; Coordination; Verb Group; Language technology; Språkteknologi; Computer and Information Sciences Computer Science; Data- och informationsvetenskap;

    Sammanfattning : This licentiate thesis deals with automatic syntactic analysis, or parsing, of natural languages. A parser constructs the syntactic analysis, which it learns by looking at correctly analyzed sentences, known as training data. The general topic concerns manipulations of the training data in order to improve the parsing accuracy. LÄS MER