Big data networks and orthology analysis

Sammanfattning: Understanding biological systems in complex organisms is important in life science in order to comprehend the interplay of genes, proteins, and compounds causing complex diseases. As biological systems are intricate, bioinformatics tools, models, and algorithms are of the utmost importance to understand the bigger picture and decipher biological meaning from the vast amounts of information available from biological experiments and predictions. Bioinformatics programs and algorithms do not only depend on information from experiments, but also on information generated from other tools in order to draw accurate conclusions and make predictions. Prediction of orthologs, genes having a common ancestry, separated by a speciation event, are important building blocks for a wide variety of tools and analysis pipelines, as they can be used to transfer gene function between species. Orthologs can for example be used to map genes of model organisms to genes in humans in studies of drug targets. They are extensively used in functional association networks in order to transfer information between species. Functional association networks are models of associations between genes or proteins, where associations can be derived from experimental evidence of different types, from the species itself, or transferred from other species using orthologs. The networks can be used to explore the context and neighbors of a gene, but also for a variety of higher-level analyses, e.g. network-based pathway enrichment analysis. In pathway enrichment analysis the networks can be utilized to contextualize experimental gene sets and annotate them with biological functions. As these tools depend on each other, it is of great importance that the networks used in pathway enrichment analysis are comprehensive and accurate, and that the orthologs used in the networks are relevant and significant. In this thesis, the development and improvement of five bioinformatics tools within three areas of bioinformatics are presented. Despite the tools residing within slightly different areas, they all rely on each other, and can all on different levels improve our understanding of biological functions and biological meaning, from the level of orthology analysis to functional association networks to pathway enrichment analysis.

  KLICKA HÄR FÖR ATT SE AVHANDLINGEN I FULLTEXT. (PDF-format)