Dynamic Cache Clustering for Chip Multiprocessors

Mohammad H. Hammoud, Sangyeun Cho, and Rami Melhem.

Proceedings of the 23rd Int'l Conference on Supercomputing (ICS), IBM T. J. Watson Research Center, NY, June 2009.

Abstract:

This paper proposes DCC (Dynamic Cache Clustering), a novel distributed cache management scheme for large-scale chip multiprocessors. Using DCC, a per-core cache cluster is comprised of a number of L2 cache banks and cache clusters are constructed, expanded, and contracted dynamically to match each core's cache demand. The basic tradeoffs of varying the on-chip cache clusters are Average L2 Access Latency (AAL) and L2 Miss Rate (MR). DCC uniquely and efficiently optimizes both the metrics and continuously tracks a near-optimal cache organization from many possible configurations. Simulation results using a full-system simulator demonstrate that DCC outperforms alternative L2 cache designs.