Sökning: "corpus linguistics"

Visar resultat 1 - 5 av 138 avhandlingar innehållade orden corpus linguistics.

  1. 1. Morphosyntactic Corpora and Tools for Persian

    Författare :Mojgan Seraji; Joakim Nivre; Carina Jahani; Jan Hajic; Uppsala universitet; []
    Nyckelord :NATURAL SCIENCES; NATURVETENSKAP; NATURVETENSKAP; NATURAL SCIENCES; Persian; language technology; corpus; treebank; preprocessing; segmentation; part-of-speech tagging; dependency parsing; Computational Linguistics; Datorlingvistik;

    Sammanfattning : This thesis presents open source resources in the form of annotated corpora and modules for automatic morphosyntactic processing and analysis of Persian texts. More specifically, the resources consist of an improved part-of-speech tagged corpus and a dependency treebank, as well as tools for text normalization, sentence segmentation, tokenization, part-of-speech tagging, and dependency parsing for Persian. LÄS MER

  2. 2. The Multilingual Forest : Investigating High-quality Parallel Corpus Development

    Författare :Yvonne Adesam; Martin Volk; Joakim Nivre; Koenraad de Smedt; Stockholms universitet; []
    Nyckelord :NATURAL SCIENCES; NATURVETENSKAP; NATURVETENSKAP; NATURAL SCIENCES; treebank; syntax; alignment; corpus; annotation projection; multilingual; tagging; parsing; datorlingvistik; Computational Linguistics;

    Sammanfattning : This thesis explores the development of parallel treebanks, collections of language data consisting of texts and their translations, with syntactic annotation and alignment, linking words, phrases, and sentences to show translation equivalence. We describe the semi-manual annotation of the SMULTRON parallel treebank, consisting of 1,000 sentences in English, German and Swedish. LÄS MER

  3. 3. Les verbes de position suédois stå, sitta, ligga et leurs équivalents français : étude contrastive

    Författare :Pauli Kortteinen; Göteborgs universitet; Göteborgs universitet; Gothenburg University; []
    Nyckelord :HUMANIORA; HUMANITIES; Swedish; French; posture verbs; corpus linguistics; contrastive linguistics; parallel corpus; translation corpus; translationese; overuse; underuse; grammaticalisation; lexicalisation; language typology; cognitive linguistics;

    Sammanfattning : The Swedish posture verbs stå ‘stand’, sitta ‘sit’ and ligga ‘lie’ are used prototypically to refer to human beings in standing, sitting and lying positions. These polysemous verbs are components of the lexical profile of the Swedish language – they are verbs of high frequency and alongside their prototypical uses they also have many metaphorical, lexicalised and grammaticalised uses with no straightforward lexical equivalents in French. LÄS MER

  4. 4. Clefts in English and Swedish: A contrastive study of IT-clefts and WH-clefts in original texts and translations

    Författare :Mats Johansson; Engelska; []
    Nyckelord :HUMANIORA; HUMANITIES; Engelska språk och litteratur ; English language and literature; information structure; ground; focus; discourse topic; topic; theme; discourse; fronting; wh-clefts; it-clefts; pseudo-cleft constructions; cleft constructions; bidirectional translation corpus; translation; corpus linguistics; contrastive linguistics; Swedish; English; Scandinavian languages and literature; Nordiska språk språk och litteratur ; Linguistics; Lingvistik;

    Sammanfattning : This study investigates the use of cleft constructions in English and Swedish on the basis of a bidirectional translation corpus consisting of original English and Swedish texts and their translations into the other language. This design minimizes the problems inherent in corpora of original texts alone, viz. LÄS MER

  5. 5. Steg för steg. Naturvetenskapligt ämnesspråk som räknas

    Författare :Judy Ribeck; Lars Borin; Emma Sköldberg; Mats Wirén; Göteborgs universitet; []
    Nyckelord :HUMANITIES; HUMANIORA; NATURAL SCIENCES; NATURVETENSKAP; NATURVETENSKAP; HUMANIORA; NATURAL SCIENCES; HUMANITIES; academic language; computational linguistics; corpus linguistics; language technology; natural language processing; scientific language; subject-specific language; Swedish textbooks; quantitative stylistics; Svenska språket; Swedish;

    Sammanfattning : In this work, I present a linguistic investigation of the language of Swedish textbooks in the natural sciences, i.e., biology, physics and chemistry. The textbooks, which are used in secondary and upper secondary school, are examined with respect to traditional readability measures, e. LÄS MER