TY - JOUR
T1 - Reliability Tradeoffs in Design of Volatile and Nonvolatile Caches
AU - Mittal, Sparsh
AU - Vetter, Jeffrey S.
N1 - Publisher Copyright:
© 2016 World Scientific Publishing Company.
PY - 2016/11/1
Y1 - 2016/11/1
N2 - Researchers have explored both volatile memories (e.g., SRAM and embedded DRAM) and nonvolatile memories (NVMs, such as resistive RAM) for design of on-chip caches. However, both volatile and nonvolatile memories present unique reliability challenges. NVMs are immune to radiation-induced soft errors, however, due to their limited write endurance, they are vulnerable to hard errors under nonuniform write distribution. By contrast, SRAM has high write endurance but is susceptible to soft errors due to cosmic radiation. SRAM-NVM hybrid caches and the management techniques for them aim to bring the best of SRAM and NVM together, however, the reliability implications of them have not been well understood. In this paper, we show that there are inherent tradeoffs in improving resilience to hard and soft errors in hybrid caches such that mitigating one may result in aggravating another. We confirm this by experiments with two recent hybrid cache management techniques. We also re-examine cache design trends in modern processors from reliability perspective. This paper provides valuable insights to system developers for making reliability-aware design decisions.
AB - Researchers have explored both volatile memories (e.g., SRAM and embedded DRAM) and nonvolatile memories (NVMs, such as resistive RAM) for design of on-chip caches. However, both volatile and nonvolatile memories present unique reliability challenges. NVMs are immune to radiation-induced soft errors, however, due to their limited write endurance, they are vulnerable to hard errors under nonuniform write distribution. By contrast, SRAM has high write endurance but is susceptible to soft errors due to cosmic radiation. SRAM-NVM hybrid caches and the management techniques for them aim to bring the best of SRAM and NVM together, however, the reliability implications of them have not been well understood. In this paper, we show that there are inherent tradeoffs in improving resilience to hard and soft errors in hybrid caches such that mitigating one may result in aggravating another. We confirm this by experiments with two recent hybrid cache management techniques. We also re-examine cache design trends in modern processors from reliability perspective. This paper provides valuable insights to system developers for making reliability-aware design decisions.
KW - Nonvolatile memory (NVM)
KW - last level cache (LLC)
KW - reliability
KW - resistive RAM (ReRAM)
KW - soft-error resilience
KW - spin transfer torque RAM (STT-RAM)
KW - write endurance
UR - http://www.scopus.com/inward/record.url?scp=84976417027&partnerID=8YFLogxK
U2 - 10.1142/S0218126616501395
DO - 10.1142/S0218126616501395
M3 - Article
AN - SCOPUS:84976417027
SN - 0218-1266
VL - 25
JO - Journal of Circuits, Systems and Computers
JF - Journal of Circuits, Systems and Computers
IS - 11
M1 - 1650139
ER -