Avancerad sökning
Visar resultat 1 - 5 av 148 avhandlingar som matchar ovanstående sökkriterier.
1. Principal Word Vectors
Sammanfattning : Word embedding is a technique for associating the words of a language with real-valued vectors, enabling us to use algebraic methods to reason about their semantic and grammatical properties. This thesis introduces a word embedding method called principal word embedding, which makes use of principal component analysis (PCA) to train a set of word embeddings for words of a language. LÄS MER
2. Learning based Word Search and Visualisation for Historical Manuscript Images
Sammanfattning : Today, work with historical manuscripts is nearly exclusively done manually, by researchers in the humanities as well as laypeople mapping out their personal genealogy. This is a highly time consuming endeavour as it is not uncommon to spend months with the same volume of a few hundred pages. LÄS MER
3. Document Image Processing for Handwritten Text Recognition : Deep Learning-based Transliteration of Astrid Lindgren’s Stenographic Manuscripts
Sammanfattning : Document image processing and handwritten text recognition have been applied to a variety of materials, scripts, and languages, both modern and historic. They are crucial building blocks in the on-going digitisation efforts of archives, where they aid in preserving archival materials and foster knowledge sharing. LÄS MER
4. Bayesian Models for Multilingual Word Alignment
Sammanfattning : In this thesis I explore Bayesian models for word alignment, how they can be improved through joint annotation transfer, and how they can be extended to parallel texts in more than two languages. In addition to these general methodological developments, I apply the algorithms to problems from sign language research and linguistic typology. LÄS MER
5. De sammansatta ordens accentuering i Skånemålen
Sammanfattning : Swedish has a contrast between two so-called tonal word accents: accent 1 and accent 2. In central standard Swedish, for example, compound words generally have accent 2 and primary stress on the first element. LÄS MER