Avancerad sökning

Visar resultat 1 - 5 av 14 avhandlingar som matchar ovanstående sökkriterier.

  1. 1. An Automated and Controlled Numerical Precision Reduction Framework for GPUs

    Författare :Alexandra Angerd; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Microarchitecture; Floating-Point Precision; Approximate Computing; Register File; GPU;

    Sammanfattning : Reducing the precision of floating-point values is an effective approach to achieve higher performance as well as higher energy-efficiency. This is especially true for GPUs, since many of its common tasks are inherently insensitive to precision-reduction. LÄS MER

  2. 2. Approximation and Compression Techniques to Enhance Performance of Graphics Processing Units

    Författare :Alexandra Angerd; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; TEKNIK OCH TEKNOLOGIER; ENGINEERING AND TECHNOLOGY; Compression; Approximate Computing; Register File; Machine Learning; Floating-Point Precision; Microarchitecture; GPU;

    Sammanfattning : A key challenge in modern computing systems is to access data fast enough to fully utilize the computing elements in the chip. In Graphics Processing Units (GPUs), the performance is often constrained by register file size, memory bandwidth, and the capacity of the main memory. LÄS MER

  3. 3. Techniques to Cancel Execution Early to Improve Processor Efficiency

    Författare :Mafijul Islam; Chalmers tekniska högskola; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; processor design; energy-efficiency; narrow-width cache; instruction reuse; zero-value cache; resource-efficient; narrow-width load; complexity-effective; small value locality; register file cache; frequent value locality; trivial instruction; silent load; high-performance; zero load;

    Sammanfattning : The evolution of computer systems to continuously improve execution efficiency has traditionally embraced various approaches across microprocessor generations. Unfortunately, contemporary processors still suffer from several inefficiencies although they offer an unprecedented level of computing capabilities. LÄS MER

  4. 4. Integrated Software Pipelining

    Författare :Mattias Eriksson; Christoph Kessler; Sid Ahmed Ali Touati; Linköpings universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Code generation; compilers; instruction scheduling; register allocation; spill code generation; modulo scheduling; integer linear programming; genetic programming.; Computer science; Datavetenskap;

    Sammanfattning : In this thesis we address the problem of integrated software pipelining for clustered VLIW architectures. The phases that are integrated and solved as one combined problem are: cluster assignment, instruction selection, scheduling, register allocation and spilling. LÄS MER

  5. 5. Leveraging Existing Microarchitectural Structures to Improve First-Level Caching Efficiency

    Författare :Ricardo Alves; David Black-Schaffer; Stefanos Kaxiras; Mattan Erez; Uppsala universitet; []
    Nyckelord :NATURVETENSKAP; NATURAL SCIENCES; Energy Efficient Caching; Memory Architecture; Single Thread Performance; First-Level Caching; Out-of-Order Pipelines; Instruction Scheduling; Filter-Cache; Way-Prediction; Value-Prediction; Register-Sharing.;

    Sammanfattning : Low-latency data access is essential for performance. To achieve this, processors use fast first-level caches combined with out-of-order execution, to decrease and hide memory access latency respectively. LÄS MER