Exploiting Fine-Grained Pipeline Parallelism for Wavefront Computations on Multicore Platforms

Guiming Wu; Miao Wang; Yong Dou; Fei Xia

doi:10.1109/ICPPW.2009.15

2009 International Conference on Parallel Processing Workshops

Exploiting Fine-Grained Pipeline Parallelism for Wavefront Computations on Multicore Platforms

Year: 2009, Pages: 402-408

DOI Bookmark: 10.1109/ICPPW.2009.15

Authors

Guiming Wu
Miao Wang
Yong Dou
Fei Xia

Abstract

This paper presents our experience with exploiting fine-grained pipeline parallelism for wavefront computations on a multicore platform. Wavefront computations have been widely applied in many application areas such as scientific computing algorithms and dynamic programming algorithms. To exploit fine-grained parallelism on multicore platforms, the programmers must consider the problems of synchronization, scheduling strategies and data locality. This paper shows the impact of fine-grained synchronization methods, scheduling strategies and data tile sizes on performance. We propose a low cost, lock-free, and lightweight synchronization method that can fully exploit pipeline parallelism. Our evaluation shows that RNAfold, an application for RNA secondary structures prediction, can achieve the best speedup of 3.88 on four cores under our framework.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

A Synthesis System For Bus-Based Wavefront Array Architectures
Proceedings of International Conference on Application Specific Systems, Architectures and Processors: ASAP '96
Study on Fine-Grained Synchronization in Many-Core Architecture
2009 10th ACIS International Conference on Software Engineering, Artificial Intelligences, Networking and Parallel/Distributed Computing
Efficient Temporal Blocking for Stencil Computations by Multicore-Aware Wavefront Parallelization
2009 33rd Annual IEEE International Computer Software and Applications Conference
(R) Scheduling of Wavefront Parallelism on Scalable Shared-memory Multiprocessors
Proceedings of the 1996 ICPP Workshop on Challenges for Parallel Processing
Exploring Multi-Grained Parallelism in Compute-Intensive DEVS Simulations
2010 IEEE Workshop on Principles of Advanced and Distributed Simulation
Out-of-Core Wavefront Computations with Reduced Synchronization
2008 16th Euromicro Conference on Parallel, Distributed and Network-based Processing - PDP '08
Insertion Tree Phasers: Efficient and Scalable Barrier Synchronization for Fine-Grained Parallelism
High Performance Computing and Communication & IEEE International Conference on Embedded Software and Systems, IEEE International Conference on
Exploiting Wavefront Parallelism on Large-Scale Shared-Memory Multiprocessors
IEEE Transactions on Parallel & Distributed Systems
Wavefront Diffusion and LMSR: Algorithms for Dynamic Repartitioning of Adaptive Meshes
IEEE Transactions on Parallel & Distributed Systems
Implementation of Production Systems on Message-Passing Computers
IEEE Transactions on Parallel & Distributed Systems

Exploiting Fine-Grained Pipeline Parallelism for Wavefront Computations on Multicore Platforms

Authors

Abstract

Related Articles