Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Feb 7;18(2):e0278346.
doi: 10.1371/journal.pone.0278346. eCollection 2023.

L2C2: Last-level compressed-contents non-volatile cache and a procedure to forecast performance and lifetime

Affiliations

L2C2: Last-level compressed-contents non-volatile cache and a procedure to forecast performance and lifetime

Carlos Escuin et al. PLoS One. .

Abstract

Several emerging non-volatile (NV) memory technologies are rising as interesting alternatives to build the Last-Level Cache (LLC). Their advantages, compared to SRAM memory, are higher density and lower static power, but write operations wear out the bitcells to the point of eventually losing their storage capacity. In this context, this paper presents a novel LLC organization designed to extend the lifetime of the NV data array and a procedure to forecast in detail the capacity and performance of such an NV-LLC over its lifetime. From a methodological point of view, although different approaches are used in the literature to analyze the degradation of an NV-LLC, none of them allows to study in detail its temporal evolution. In this sense, this work proposes a forecasting procedure that combines detailed simulation and prediction, allowing an accurate analysis of the impact of different cache control policies and mechanisms (replacement, wear-leveling, compression, etc.) on the temporal evolution of the indices of interest, such as the effective capacity of the NV-LLC or the system IPC. We also introduce L2C2, a LLC design intended for implementation in NV memory technology that combines fault tolerance, compression, and internal write wear leveling for the first time. Compression is not used to store more blocks and increase the hit rate, but to reduce the write rate and increase the lifetime during which the cache supports near-peak performance. In addition, to support byte loss without performance drop, L2C2 inherently allows N redundant bytes to be added to each cache entry. Thus, L2C2+N, the endurance-scaled version of L2C2, allows balancing the cost of redundant capacity with the benefit of longer lifetime. For instance, as a use case, we have implemented the L2C2 cache with STT-RAM technology. It has affordable hardware overheads compared to that of a baseline NV-LLC without compression in terms of area, latency and energy consumption, and increases up to 6-37 times the time in which 50% of the effective capacity is degraded, depending on the variability in the manufacturing process. Compared to L2C2, L2C2+6 which adds 6 bytes of redundant capacity per entry, that means 9.1% of storage overhead, can increase up to 1.4-4.3 times the time in which the system gets its initial peak performance degraded.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Average write rate per frame in sets with A live frames as a function of capacity (90%, 75%, and 50%).
Fig 2
Fig 2. Block flow diagram of non-inclusive model.
Fig 3
Fig 3. Block classification regarding its compression ratio for the selected SPEC CPU 2006 and 2017 applications.
Fig 4
Fig 4. Layout of a frame entry in the SRAM tag and NVM data arrays.
Fig 5
Fig 5. Flow of writing a block in L2C2 and components involved.
Fig 6
Fig 6. Flow of reading a block in L2C2 and components involved.
Fig 7
Fig 7. Example of ECB rearranging to write a 9-byte frame.
Fig 8
Fig 8. Per-byte Remaining Writes (A) and Write Rate (B) maps.
Fig 9
Fig 9. Forecasting procedure diagram.
Basic procedure in black, approximations in blue.
Fig 10
Fig 10. Forecasted T50C (in years) as a function of the number of epochs for frame disabling (A) and L2C2 (B) caches.
Three coefficients of variation are employed: cv = 0.2, 0.25, and 0.3.
Fig 11
Fig 11. Effective capacity evolution over time until 50% of capacity is lost for three different cv.
cv = 0.2 (A), cv = 0.25 (B), cv = 0.3 (C).
Fig 12
Fig 12. Normalized IPC evolution over time until 50% of capacity is lost for three different cv.
cv = 0.2 (A), cv = 0.25 (B), cv = 0.3 (C).
Fig 13
Fig 13. IPC evolution until losing 50% of capacity of an L2C2 without intra-frame wear-leveling mechanism, L2C2-NWL, for cv = 0.2.
Fig 14
Fig 14. IPC evolution until losing 50% of capacity of an L2C2 with LRU-Best-Fit replacement policy, L2C2-BF, for cv = 0.2.
Fig 15
Fig 15. IPC evolution until losing 50% of capacity of FD and L2C2 for cv = 0.2.
Doubling cache size (A), doubling the number of cores while keeping the same 4MB/core (B), and only considering the most memory-intensive programs (C).

References

    1. Sakhare S, Perumkunnil M, Bao TH, Rao S, Kim W, Crotti D, et al. Enablement of STT-MRAM as last level cache for the high performance computing domain at the 5nm node. In: 2018 IEEE Int. Electron Devices Meeting (IEDM); 2018. p. 18.3.1–18.3.4.
    1. Lee BC, Ipek E, Mutlu O, Burger D. Architecting phase change memory as a scalable dram alternative. In: Proc. of the 36th annual Int. Symp. on Computer architecture; 2009. p. 2–13.
    1. Qureshi MK, Gurumurthi S, Rajendran B. Phase change memory: From devices to systems. Synthesis Lectures on Computer Architecture. 2011;6(4):1–134. doi: 10.1007/978-3-031-01735-3 - DOI
    1. Joo Y, Niu D, Dong X, Sun G, Chang N, Xie Y. Energy-and endurance-aware design of phase change memory caches. In: 2010 Design, Automation & Test in Europe Conf. & Exhibition (DATE 2010). IEEE; 2010. p. 136–141.
    1. Apalkov D, Khvalkovskiy A, Watts S, Nikitin V, Tang X, Lottis D, et al.. Spin-transfer torque magnetic random access memory (STT-MRAM). ACM Journal on Emerging Technologies in Computing Systems (JETC). 2013;9(2):1–35. doi: 10.1145/2463585.2463589 - DOI

Publication types