Sökning: "Joakim Nivre"
Visar resultat 21 - 25 av 29 avhandlingar innehållade orden Joakim Nivre.
21. Spelling Normalisation and Linguistic Analysis of Historical Text for Information Extraction
Sammanfattning : Historical text constitutes a rich source of information for historians and other researchers in humanities. Many texts are however not available in an electronic format, and even if they are, there is a lack of NLP tools designed to handle historical text. LÄS MER
22. Translation as Linear Transduction : Models and Algorithms for Efficient Learning in Statistical Machine Translation
Sammanfattning : Automatic translation has seen tremendous progress in recent years, mainly thanks to statistical methods applied to large parallel corpora. Transductions represent a principled approach to modeling translation, but existing transduction classes are either not expressive enough to capture structural regularities between natural languages or too complex to support efficient statistical induction on a large scale. LÄS MER
23. Morphosyntactic Corpora and Tools for Persian
Sammanfattning : This thesis presents open source resources in the form of annotated corpora and modules for automatic morphosyntactic processing and analysis of Persian texts. More specifically, the resources consist of an improved part-of-speech tagged corpus and a dependency treebank, as well as tools for text normalization, sentence segmentation, tokenization, part-of-speech tagging, and dependency parsing for Persian. LÄS MER
24. Segmenting and Tagging Text with Neural Networks
Sammanfattning : Segmentation and tagging of text are important preprocessing steps for higher-level natural language processing tasks. In this thesis, we apply a sequence labelling framework based on neural networks to various segmentation and tagging tasks, including sentence segmentation, word segmentation, morpheme segmentation, joint word segmentation and part-of-speech tagging, and named entity transliteration. LÄS MER
25. Compound Processing for Phrase-Based Statistical Machine Translation
Sammanfattning : In this thesis I explore how compound processing can be used to improve phrase-based statistical machine translation (PBSMT) between English and German/Swedish. Both German and Swedish generally use closed compounds, which are written as one word without spaces or other indicators of word boundaries. LÄS MER