Sökning: "Joakim Nivre"
Visar resultat 26 - 29 av 29 avhandlingar innehållade orden Joakim Nivre.
26. Text Harmonization Strategies for Phrase-Based Statistical Machine Translation
Sammanfattning : In this thesis I aim to improve phrase-based statistical machine translation (PBSMT) in a number of ways by the use of text harmonization strategies. PBSMT systems are built by training statistical models on large corpora of human translations. This architecture generally performs well for languages with similar structure. LÄS MER
27. Understanding Neural Machine Translation : An investigation into linguistic phenomena and attention mechanisms
Sammanfattning : In this thesis, I explore neural machine translation (NMT) models via targeted investigation of various linguistic phenomena and thorough exploration of the internal structure of NMT models, in particular the attention mechanism. With respect to linguistic phenomena, I explore the ability of NMT models to translate ambiguous words, to learn long-range dependencies, to learn morphology, and to translate negation—linguistic phenomena that have been challenging for the older paradigm of statistical machine translation. LÄS MER
28. Recycling Translations : Extraction of Lexical Data from Parallel Corpora and their Application in Natural Language Processing
Sammanfattning : The focus of this thesis is on re-using translations in natural language processing. It involves the collection of documents and their translations in an appropriate format, the automatic extraction of translation data, and the application of the extracted data to different tasks in natural language processing. LÄS MER
29. Predicting Linguistic Structure with Incomplete and Cross-Lingual Supervision
Sammanfattning : Contemporary approaches to natural language processing are predominantly based on statistical machine learning from large amounts of text, which has been manually annotated with the linguistic structure of interest. However, such complete supervision is currently only available for the world's major languages, in a limited number of domains and for a limited range of tasks. LÄS MER