Sökning: "document clustering"

Visar resultat 1 - 5 av 8 avhandlingar innehållade orden document clustering.

  1. 1. Clustering in Swedish The Impact of some Properties of the Swedish Language on Document Clustering and an Evaluation Method

    Detta är en avhandling från Stockholm : KTH

    Författare :Magnus Rosell; KTH.; [2005]
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Document Clustering; HUMANITIES and RELIGION Languages and linguistics Linguistic subjects Language technology; HUMANIORA och RELIGIONSVETENSKAP Språkvetenskap Lingvistikämnen Språkteknologi;

    Sammanfattning : Text clustering divides a set of texts into groups, so that texts within each group are similar in content. It may be used to uncover the structure and content of unknown text sets as well as to give new perspectives on known ones. LÄS MER

  2. 2. Under utgivning: den vetenskapliga utgivningens bibliografiska funktion The editors text: bibliographic functions in scholarly editing

    Detta är en avhandling från Göteborg : Göteborg University

    Författare :Mats Dahlström; Högskolan i Borås.; Göteborgs universitet.; Gothenburg University.; [2006]
    Nyckelord :SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; document studies; knowledge organisation; clustering; conceptual analysis; bibliography; media theory; reference works; textual criticism; transposition; Bibliography; Clustering; Conceptual analysis; Document studies; Knowledge organization; Library and information science; Media theory; Reference works; Textual criticism; Transposition;

    Sammanfattning : The thesis investigates in what way the scholarly edition performs bibliographic functions as it manages and positions other documents. This is where the study differs from previous research on scholarly editing and bibliography. LÄS MER

  3. 3. Multi-Document Summarization and Semantic Relatedness

    Detta är en avhandling från Göteborg : Göteborg University

    Författare :Olof Mogren; [2015]
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; multi-document summarization; automatic summarization; semantic similarity; semantic relatedness;

    Sammanfattning : Automatic summarization is the process of presenting the contents of written documents in a short, comprehensive fashion. Many approaches have been proposed for this problem, some of which extract content from the input documents (extractive methods), and others that generate the language in the summary based on some representation of the document contents (abstractive methods). LÄS MER

  4. 4. Automated subject classification of textual web pages, for browsing

    Detta är en avhandling från Digital Information Systems Group, Department of Information Technology, Lund University

    Författare :Koraljka Golub; Lunds universitet.; Lund University.; [2005]
    Nyckelord :SAMHÄLLSVETENSKAP; SOCIAL SCIENCES; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Library and Information Science; Biblioteks- och informationsvetenskap; automated classification; subject browsing; structural Web-page elements; Web page classification; document clustering; bibliographic coupling; text categorization; Subject classification;

    Sammanfattning : With the exponential growth of the World Wide Web, automated subject classification of Web pages has become a major research issue in information and computer sciences. Organizing Web pages into a hierarchical structure for subject browsing is gaining more recognition as an important tool in information-seeking processes. LÄS MER

  5. 5. Gossip-based Algorithms for Information Dissemination and Graph Clustering

    Detta är en avhandling från Stockholm : KTH Royal Institute of Technology

    Författare :Fatemeh Rahimian; KTH.; SICS.; [2014]
    Nyckelord :TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; NATURVETENSKAP; NATURAL SCIENCES;

    Sammanfattning : Decentralized algorithms are becoming ever more prevalent in almost all real-world applications that are either data intensive, computation intensive or both. This thesis presents a few decentralized solutions for large-scale (i) data dissemination, (ii) graph partitioning, and (iii) data disambiguation. LÄS MER