Methods and Applications in Comparative Bacterial Genomics

Detta är en avhandling från Uppsala : Acta Universitatis Upsaliensis

Författare: Hans-henrik Fuxelius; Uppsala Universitet.; [2007]

Nyckelord: Biology; Biologi;

Sammanfattning: Comparative studies of bacterial genomes, now counting in the hundreds, generate massive amounts of information. In order to support a systematic and efficient approach to genomic analyses, a database driven system with graphic visualization of genomic properties was developed - GenComp. The software was applied to studies of obligate intracellular bacteria. In all studies, ORFs were extracted and grouped into ORF-families. Based on gene order synteny, orthologous clusters of core genes and variable spacer ORFs were identified and extracted for alignments and computation of substitution frequencies.The software was applied to the genomes of six Chlamydia trachomatis strains to identify the most rapidly evolving genes. Five genes were chosen for genotyping, and close to a 3-fold higher discrimination capacity was achieved than that of serotypes.With GenComp as the backbone, a massive comparative analysis were performed on the variable gene set in the Rickettsiaceae, which includes Rickettsia prowazekii and Orientia tsutsugamushi, the agents of epidemic and scrub typhus, respectively. O. tsutsugamushi has the most exceptional bacterial genome identified to date; the 2.2 Mb genome is 200-fold more repeated than the 1.1 Mb R. prowazekii genome due to an extensive proliferation of conjugative type IV secretion systems and associated genes. GenComp identified 688 core genes that are conserved across 7 closely related Rickettsia genomes along with a set of 469 variably present genes with homologs in other species. The analysis indicates that up to 70% of the extensively degraded and variably present genes represent mobile genetic elements and genes putatively acquired by horizontal gene transfer. This explains the paradox of the high pseudogene load in the small Rickettsia genomes. This study demonstrates that GenComp provides an efficient system for pseudogene identification and may help distinguish genes from spurious ORFs in the many pan-genome sequencing projects going on worldwide.

  KLICKA HÄR FÖR ATT SE AVHANDLINGEN I FULLTEXT. (PDF-format)