Default Cover Image

Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques

Sept. 7 2013 to Sept. 11 2013

Edinburgh, United Kingdom

Table of Contents

CopyrightFreely available from IEEE.pp. ii-ii
General chairs' welcome messageFreely available from IEEE.pp. iii-iii
Coordinated power-performance optimization in manycoresFull-text access may be available. Sign in or learn about subscription options.pp. 51-62
An opportunistic prediction-based thread scheduling to maximize throughput/watt in AMPsFull-text access may be available. Sign in or learn about subscription options.pp. 63-72
APOGEE: adaptive prefetching on GPUs for energy efficiencyFull-text access may be available. Sign in or learn about subscription options.pp. 73-82
Parallel frame rendering: trading responsiveness for energy on a mobile GPUFull-text access may be available. Sign in or learn about subscription options.pp. 83-92
Exploring hybrid memory for GPU energy efficiency through software-hardware co-designFull-text access may be available. Sign in or learn about subscription options.pp. 93-102
S-CAVE: effective SSD caching to improve virtual machine storage performanceFull-text access may be available. Sign in or learn about subscription options.pp. 103-112
L1-bandwidth aware thread allocation in multicore SMT processorsFull-text access may be available. Sign in or learn about subscription options.pp. 123-132
Memory-centric system interconnect design with hybrid memory cubesFull-text access may be available. Sign in or learn about subscription options.pp. 145-156
Neither more nor less: optimizing thread-level parallelism for GPGPUsFull-text access may be available. Sign in or learn about subscription options.pp. 157-166
Exploring hybrid memory for GPU energy efficiency through software-hardware co-designFull-text access may be available. Sign in or learn about subscription options.pp. 93-102
Fairness-aware scheduling on single-ISA heterogeneous multi-coresFull-text access may be available. Sign in or learn about subscription options.pp. 177-188
DANBI: dynamic scheduling of irregular stream programs for many-core systemsFull-text access may be available. Sign in or learn about subscription options.pp. 189-200
An empirical model for predicting cross-core performance interference on multicore processorsFull-text access may be available. Sign in or learn about subscription options.pp. 201-212
Jigsaw: scalable software-defined cachesFull-text access may be available. Sign in or learn about subscription options.pp. 213-224
Managing shared last-level cache in a heterogeneous multicore processorFull-text access may be available. Sign in or learn about subscription options.pp. 225-234
Reshaping cache misses to improve row-buffer locality in multicore systemsFull-text access may be available. Sign in or learn about subscription options.pp. 235-244
Transparent CPU-GPU collaboration for data-parallel kernels on heterogeneous systemsFull-text access may be available. Sign in or learn about subscription options.pp. 245-256
RSVM: a region-based software virtual memory for GPUFull-text access may be available. Sign in or learn about subscription options.pp. 269-278
Meeting midway: improving CMP performance with memory-side prefetchingFull-text access may be available. Sign in or learn about subscription options.pp. 289-298
Building expressive, area-efficient coherence directoriesFull-text access may be available. Sign in or learn about subscription options.pp. 299-308
McRouter: multicast within a router for high performance network-on-chipsFull-text access may be available. Sign in or learn about subscription options.pp. 319-330
Concurrent predicates: a debugging technique for every parallel programmerFull-text access may be available. Sign in or learn about subscription options.pp. 331-340
Breaking SIMD shackles with an exposed flexible microarchitecture and the access execute PDGFull-text access may be available. Sign in or learn about subscription options.pp. 341-352
Vectorization past dependent branches through speculationFull-text access may be available. Sign in or learn about subscription options.pp. 353-362
Automatic vectorization of tree traversalsFull-text access may be available. Sign in or learn about subscription options.pp. 363-374
Generating efficient data movement code for heterogeneous architectures with distributed-memoryFull-text access may be available. Sign in or learn about subscription options.pp. 375-386
Automatic OpenCL work-group size selection for multicore CPUsFull-text access may be available. Sign in or learn about subscription options.pp. 387-398
McRouter: Multicast within a router for high performance network-on-chipsFull-text access may be available. Sign in or learn about subscription options.pp. 319-329
Task sampling: computer architecture simulation in the many-core eraFull-text access may be available. Sign in or learn about subscription options.pp. 405-406
PS-cache: an energy-efficient cache design for chip multiprocessorsFull-text access may be available. Sign in or learn about subscription options.pp. 407-408
Dynamic memory access monitoring based on tagged memoryFull-text access may be available. Sign in or learn about subscription options.pp. 409-410
Exposing ILP in custom hardware with a dataflow compiler IRFull-text access may be available. Sign in or learn about subscription options.pp. 411-412
TCPT - Thread criticality-driven prefetcher throttlingFull-text access may be available. Sign in or learn about subscription options.pp. 399-399
Do inputs matter? using data-dependence profiling to evaluate thread level speculation in BG/QFull-text access may be available. Sign in or learn about subscription options.pp. 401-401
Can lock-free and combining techniques co-exist? A novel approach on concurrent queueFull-text access may be available. Sign in or learn about subscription options.pp. 403-403
Task sampling: Computer architecture simulation in the many-core eraFull-text access may be available. Sign in or learn about subscription options.pp. 405-405
PS-cache: An energy-efficient cache design for chip multiprocessorsFull-text access may be available. Sign in or learn about subscription options.pp. 407-407
Dynamic memory access monitoring based on tagged memoryFull-text access may be available. Sign in or learn about subscription options.pp. 409-409
Exposing ILP in custom hardware with a dataflow compiler IRFull-text access may be available. Sign in or learn about subscription options.pp. 411-411
Author indexFreely available from IEEE.pp. 412-412
Showing 52 out of 52