Real-Time GPU Computing: Cache or No Cache?

Yijie Huangfu; Wei Zhang

doi:10.1109/ISORC.2015.12

2015 IEEE 18th International Symposium on Real-Time Distributed Computing (ISORC)

Real-Time GPU Computing: Cache or No Cache?

Year: 2015, Pages: 182-189

DOI Bookmark: 10.1109/ISORC.2015.12

Authors

Yijie Huangfu
Wei Zhang

Abstract

Recent Graphics Processing Units (GPUs) have employed cache memories to boost performance. However, cache memories are well known to be harmful to time predictability for CPUs. For high-performance real-time systems using GPUs, it remains unknown whether or not cache memories should be employed. In this paper, we quantitatively compare the performance for GPUs with and without caches, and find that GPUs without the cache actually lead to better average-case performance, with higher time predictability. However, we also study a profiling-based cache bypassing method, which can use the L1 data cache more efficiently to achieve better average-case performance than that without the cache. Therefore, it seems still beneficial to employ caches for real-time computing on GPUs.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

QuickRelease: A throughput-oriented approach to release consistency on GPUs
2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)
The impact of extrinsic cache performance on predictability of real-time systems
Proceedings Second International Workshop on Real-Time Computing Systems and Applications
Selectively GPU Cache Bypassing for Un-Coalesced Loads
2016 IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS)
Coordinated static and dynamic cache bypassing for GPUs
2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA)
A detailed GPU cache model based on reuse distance theory
2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)
Boosting GPU Performance by Profiling-Based L1 Data Cache Bypassing
2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)
Cache coherence for GPU architectures
2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA)
ID-cache: instruction and memory divergence based cache management for GPUs
2016 IEEE International Symposium on Workload Characterization (IISWC)
MRPB: Memory request prioritization for massively parallel processors
2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)
Power Efficient Sharing-Aware GPU Data Management
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Real-Time GPU Computing: Cache or No Cache?

Authors

Abstract

Related Articles