Massively parallel analysis of cells and nucleic acids

Detta är en avhandling från Stockholm : KTH Royal Institute of Technology

Sammanfattning: Recent proceedings in biotechnology have enabled completely new avenues in life science research to be explored. By allowing increased parallelization an ever-increasing complexity of cell samples or experiments can be investigated in shorter time and at a lower cost. This facilitates for example large-scale efforts to study cell heterogeneity at the single cell level, by analyzing cells in parallel that also can include global genomic analyses. The work presented in this thesis focuses on massively parallel analysis of cells or nucleic acid samples, demonstrating technology developments in the field as well as use of the technology in life sciences.In stem cell research issues such as cell morphology, cell differentiation and effects of reprogramming factors are frequently studied, and to obtain information on cell heterogeneity these experiments are preferably carried out on single cells. In paper I we used a high-density microwell device in silicon and glass for culturing and screening of stem cells. Maintained pluripotency in stem cells from human and mouse was demonstrated in a screening assay by antibody staining and the chip was furthermore used for studying neural differentiation. The chip format allows for low sample volumes and rapid high-throughput analysis of single cells, and is compatible with Fluorescence Activated Cell Sorting (FACS) for precise cell selection.Massively parallel DNA sequencing is revolutionizing genomics research throughout the life sciences by constantly producing increasing amounts of data from one sequencing run. However, the reagent costs and labor requirements in current massively parallel sequencing protocols are still substantial. In paper II-IV we have focused on flow-sorting techniques for improved sample preparation in bead-based massive sequencing platforms, with the aim of increasing the amount of quality data output, as demonstrated on the Roche/454 platform. In paper II we demonstrate a rapid alternative to the existing shotgun sample titration protocol and also use flow-sorting to enrich for beads that carry amplified template DNA after emulsion PCR, thus obtaining pure samples and with no downstream sacrifice of DNA sequencing quality. This should be seen in comparison to the standard 454-enrichment protocol, which gives rise to varying degrees of sample purity, thus affecting the sequence data output of the sequencing run. Massively parallel sequencing is also useful for deep sequencing of specific PCR-amplified targets in parallel. However, unspecific product formation is a common problem in amplicon sequencing and since these shorter products may be difficult to fully remove by standard procedures such as gel purification, and their presence inevitably reduces the number of target sequence reads that can be obtained in each sequencing run. In paper III a gene-specific fluorescent probe was used for target-specific FACS enrichment to specifically enrich for beads with an amplified target gene on the surface. Through this procedure a nearly three-fold increase in fraction of informative sequences was obtained and with no sequence bias introduced. Barcode labeling of different DNA libraries prior to pooling and emulsion PCR is standard procedure to maximize the number of experiments that can be run in one sequencing lane, while also decreasing the impact of technical noise. However, variation between libraries in quality and GC content affects amplification efficiency, which may result in biased fractions of the different libraries in the sequencing data. In paper IV barcode specific labeling and flow-sorting for normalization of beads with different barcodes on the surface was used in order to weigh the proportion of data obtained from different samples, while also removing mixed beads, and beads with no or poorly amplified product on the surface, hence also resulting in an increased sequence quality.In paper V, cell heterogeneity within a human being is being investigated by low-coverage whole genome sequencing of single cell material. By focusing on the most variable portion of the human genome, polyguanine nucleotide repeat regions, variability between different cells is investigated and highly variable polyguanine repeat loci are identified. By selectively amplifying and sequencing polyguanine nucleotide repeats from single cells for which the phylogenetic relationship is known, we demonstrate that massively parallel sequencing can be used to study cell-cell variation in length of these repeats, based on which a phylogenetic tree can be drawn.

  KLICKA HÄR FÖR ATT SE AVHANDLINGEN I FULLTEXT. (PDF-format)