Default Cover Image

2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT)

Sept. 19 2012 to Sept. 23 2012

Minneapolis, MN, USA

Table of Contents

[Front matter]Freely available from IEEE.pp. c1-c1
The changing role of supercomputingFreely available from IEEE.pp. 1-1
PGCapping: Exploiting power gating for power capping and core lifetime balancing in CMPsFull-text access may be available. Sign in or learn about subscription options.pp. 13-22
Power-efficient time-sensitive mapping in heterogeneous systemsFull-text access may be available. Sign in or learn about subscription options.pp. 23-32
Riposte: A trace-driven compiler and parallel VM for vector code in RFull-text access may be available. Sign in or learn about subscription options.pp. 43-51
Auto-parallelizing stateful distributed streaming applicationsFull-text access may be available. Sign in or learn about subscription options.pp. 53-64
APCR: An adaptive physical channel regulator for On-Chip InterconnectsFull-text access may be available. Sign in or learn about subscription options.pp. 87-96
Pointy: A hybrid pointer prefetcher for managed runtime systemsFull-text access may be available. Sign in or learn about subscription options.pp. 97-106
Scalability-based manycore partitioningFull-text access may be available. Sign in or learn about subscription options.pp. 107-116
Enhancing performance optimization of multicore chips and multichip nodes with data structure metricsFull-text access may be available. Sign in or learn about subscription options.pp. 147-156
Compiling to avoid communicationFull-text access may be available. Sign in or learn about subscription options.pp. 157-157
Visualizing transactional memoryFull-text access may be available. Sign in or learn about subscription options.pp. 159-170
Sandboxing transactional memoryFull-text access may be available. Sign in or learn about subscription options.pp. 171-179
RISE: Improving the streaming processors reliability against soft errors in GPGPUsFull-text access may be available. Sign in or learn about subscription options.pp. 191-200
Top500 versus sustained performance - the top problems with the TOP500 list - and what to do about themFull-text access may be available. Sign in or learn about subscription options.pp. 223-230
Practically Private: Enabling high performance CMPs through compiler-assisted data classificationFull-text access may be available. Sign in or learn about subscription options.pp. 231-240
Complexity-effective multicore coherenceFull-text access may be available. Sign in or learn about subscription options.pp. 241-251
HaLock: Hardware-assisted lock contention detection in multithreaded applicationsFull-text access may be available. Sign in or learn about subscription options.pp. 253-262
Runtime detection and optimization of collective communication patternsFull-text access may be available. Sign in or learn about subscription options.pp. 263-272
Coalition Threading: Combining traditional and non-traditional parallelism to maximize scalabilityFull-text access may be available. Sign in or learn about subscription options.pp. 273-282
Shared memory multiplexing: A novel way to improve GPGPU throughputFull-text access may be available. Sign in or learn about subscription options.pp. 283-292
Efficient techniques for predicting cache sharing and throughputFull-text access may be available. Sign in or learn about subscription options.pp. 305-314
Optimal bypass monitor for high performance last-level cachesFull-text access may be available. Sign in or learn about subscription options.pp. 315-324
Lossless and lossy memory I/O link compression for improving performance of GPGPU workloadsFull-text access may be available. Sign in or learn about subscription options.pp. 325-334
Multi2Sim: A simulation framework for CPU-GPU computingFull-text access may be available. Sign in or learn about subscription options.pp. 335-344
A yoke of oxen and a thousand chickens for heavy lifting graph processingFull-text access may be available. Sign in or learn about subscription options.pp. 345-354
The evicted-address filter: A unified mechanism to address both cache pollution and thrashingFull-text access may be available. Sign in or learn about subscription options.pp. 355-366
Hardware acceleration in the IBM PowerEN processor: architecture and performanceFull-text access may be available. Sign in or learn about subscription options.pp. 389-399
Workload and power budget partitioning for single-chip heterogeneous processorsFull-text access may be available. Sign in or learn about subscription options.pp. 401-410
Database analytics acceleration using FPGAsFull-text access may be available. Sign in or learn about subscription options.pp. 411-420
Acceleration of bulk memory operations in a heterogeneous multicore architectureFull-text access may be available. Sign in or learn about subscription options.pp. 423-424
Integrating nanophotonics in GPU microarchitectureFull-text access may be available. Sign in or learn about subscription options.pp. 425-426
Branch and data herding: Reducing control and memory divergence for error-tolerant GPU applicationsFull-text access may be available. Sign in or learn about subscription options.pp. 427-428
Layout-oblivious optimization for matrix computationsFull-text access may be available. Sign in or learn about subscription options.pp. 429-430
Boost.SIMD: Generic programming for portable SIMDizationFull-text access may be available. Sign in or learn about subscription options.pp. 431-432
Speculative parallelization needs rigorFull-text access may be available. Sign in or learn about subscription options.pp. 433-434
Supporting stateful tasks in a dataflow graphFull-text access may be available. Sign in or learn about subscription options.pp. 435-436
MaSiF: Machine learning guided auto-tuning of Parallel SkeletonsFull-text access may be available. Sign in or learn about subscription options.pp. 437-438
TMNOC: A case of HTM and NoC Co-design for increased energy efficiency and concurrencyFull-text access may be available. Sign in or learn about subscription options.pp. 439-440
ReCaP: A Region-Based cure for the common cold cacheFull-text access may be available. Sign in or learn about subscription options.pp. 443-444
Power-efficient computing for compute-intensive GPGPU applicationsFull-text access may be available. Sign in or learn about subscription options.pp. 445-446
Off-chip access localization for NoC-based multicoresFull-text access may be available. Sign in or learn about subscription options.pp. 447-448
Many-thread aware instruction-level parallelism: Architecting shader cores for GPU computingFull-text access may be available. Sign in or learn about subscription options.pp. 449-450
PS-Dir: A scalable two-level directory cacheFull-text access may be available. Sign in or learn about subscription options.pp. 451-452
Bandwidth Bandit: Quantitative characterization of memory contentionFull-text access may be available. Sign in or learn about subscription options.pp. 457-458
Speculative dynamic vectorization for HW/SW codesigned processorsFull-text access may be available. Sign in or learn about subscription options.pp. 459-460
Fine-grained parallel traversals of irregular data structuresFull-text access may be available. Sign in or learn about subscription options.pp. 461-462
High-performance analysis of filtered semantic graphsFull-text access may be available. Sign in or learn about subscription options.pp. 463-464
Energy-efficient cache partitioning for future CMPsFull-text access may be available. Sign in or learn about subscription options.pp. 465-466
A low-overhead dynamic optimization framework for multicoresFull-text access may be available. Sign in or learn about subscription options.pp. 467-468
Making it practical and effective: Fast and precise May-Happen-in-Parallel analysisFull-text access may be available. Sign in or learn about subscription options.pp. 469-470
Mileage-based Contention Management in Transactional MemoryFull-text access may be available. Sign in or learn about subscription options.pp. 471-471
System-level power-performance efficiency modeling for emergent GPU architecturesFull-text access may be available. Sign in or learn about subscription options.pp. 473-473
Transactional event profiling in a best-effort hardware transactional memory systemFull-text access may be available. Sign in or learn about subscription options.pp. 475-475
Transparent runtime deadlock eliminationFull-text access may be available. Sign in or learn about subscription options.pp. 477-477
Design of a storage processing unitFull-text access may be available. Sign in or learn about subscription options.pp. 479-479
SkipCache: Miss-rate aware cache managementFull-text access may be available. Sign in or learn about subscription options.pp. 481-481
Using combined profiling to decide when Thread Level Speculation is profitableFull-text access may be available. Sign in or learn about subscription options.pp. 483-483
Hardware prefetchers for emerging parallel applicationsFull-text access may be available. Sign in or learn about subscription options.pp. 485-485
Strategies based on green policies to the grid resource allocationFull-text access may be available. Sign in or learn about subscription options.pp. 487-487
Linearly compressed pages: A main memory compression framework with low complexity and low latencyFull-text access may be available. Sign in or learn about subscription options.pp. 489-489
Energy-efficient workload mapping in heterogeneous systems with multiple types of resourcesFull-text access may be available. Sign in or learn about subscription options.pp. 491-491
Phase-based scheduling and thread migration for heterogeneous multicore processorsFull-text access may be available. Sign in or learn about subscription options.pp. 493-493
Author indexFreely available from IEEE.pp. 495-496
Showing 81 out of 81