2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA)
Download PDF

Abstract

We propose an ISA extension that decouples the data access and register write operations in a load instruction. We describe system and hardware support for decoupled loads. Furthermore, we show how compilers can generate better static instruction schedules by hoisting a decoupled load's data access above may-alias stores and branches. We find that decoupled loads improve performance with geometric mean speedups of 8.4%.
Like what you’re reading?
Already a member?
Get this article FREE with a new membership!

Related Articles