Sökning: "extract corpus"

Visar resultat 1 - 5 av 11 avhandlingar innehållade orden extract corpus.

  1. 1. Resource Lean and Portable Automatic Text Summarization

    Författare :Martin Hassel; Hercules Dalianis; Viggo Kann; Kerstin Severinson Eklundh; Horacio Saggion; KTH; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; holsum; language independent; holistic; summarization; lexical semantics; co-occurrence statistics; word space model; bag-of-words; bag-of-concepts; random indexing; swesum; news corpus; extract corpus; Computer science; Datalogi;

    Sammanfattning : Today, with digitally stored information available in abundance, even for many minor languages, this information must by some means be filtered and extracted in order to avoid drowning in it. Automatic summarization is one such technique, where a computer summarizes a longer text to a shorter non-rendundant form. LÄS MER

  2. 2. Steg för steg. Naturvetenskapligt ämnesspråk som räknas : Step by step. A computational analysis of Swedish textbook language

    Författare :Judy Carola Ribeck; Göteborgs universitet; []
    Nyckelord :HUMANIORA; HUMANITIES; SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; academic language; computational linguistics; corpus linguistics; language technology; natural language processing; scientific language; subject-specific language; Swedish textbooks; quantitative stylistics;

    Sammanfattning : In this work, I present a linguistic investigation of the language of Swedish textbooks in the natural sciences, i.e., biology, physics and chemistry. The textbooks, which are used in secondary and upper secondary school, are examined with respect to traditional readability measures, e. LÄS MER

  3. 3. Steg för steg. Naturvetenskapligt ämnesspråk som räknas

    Författare :Judy Ribeck; Lars Borin; Emma Sköldberg; Mats Wirén; Göteborgs universitet; []
    Nyckelord :HUMANIORA; HUMANITIES; NATURVETENSKAP; NATURAL SCIENCES; academic language; computational linguistics; corpus linguistics; language technology; natural language processing; scientific language; subject-specific language; Swedish textbooks; quantitative stylistics; Svenska språket; Swedish;

    Sammanfattning : In this work, I present a linguistic investigation of the language of Swedish textbooks in the natural sciences, i.e., biology, physics and chemistry. The textbooks, which are used in secondary and upper secondary school, are examined with respect to traditional readability measures, e. LÄS MER

  4. 4. Building Knowledge Graphs : Processing Infrastructure and Named Entity Linking

    Författare :Marcus Klang; Robotik och Semantiska System; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; natural language processing; machine learning; computational lingustics; named entity linking;

    Sammanfattning : Things such as organizations, persons, or locations are ubiquitous in all texts circulating on the internet, particularly in the news, forum posts, and social media. Today, there is more written material than any single person can read through during a typical lifespan. LÄS MER

  5. 5. Extracting Clinical Findings from Swedish Health Record Text

    Författare :Maria Skeppstedt; Hercules Dalianis; Gunnar Nilsson; Maria Kvist; Tapio Salakoski; Stockholms universitet; []
    Nyckelord :SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; Named entity recognition; Corpora development; Clinical text processing; Distributional semantics; Random indexing; Vocabulary expansion; Assertion classification; Clinical text mining; Electronic health records; Swedish; Computer and Systems Sciences; data- och systemvetenskap;

    Sammanfattning : Information contained in the free text of health records is useful for the immediate care of patients as well as for medical knowledge creation. Advances in clinical language processing have made it possible to automatically extract this information, but most research has, until recently, been conducted on clinical text written in English. LÄS MER