Abstract
This paper shows mathematically and experimentally that inline deduplication is not suitable for file systems on ultra-low latency Intel Optane DC PM devices in terms of performance, and proposes DeNova, an offline deduplication specially designed for log-structured NVM file systems such as NOVA. DeNova offers high-performance and low-latency I/O processing and executes deduplication in the background without interfering with foreground I/Os. DeNova employs DRAM-free persistent deduplication metadata, favoring CPU cache line, and ensures failure consistency on any system failure. We implement DeNova in the NOVA file system. Evaluation with DeNova confirms a negligible performance drop of baseline NOVA of less than 1%, while gaining high storage space savings. Extensive experiments show DeNova is failure consistent in all failure scenario cases.
Original language | English |
---|---|
Title of host publication | Proceedings - 2022 IEEE 36th International Parallel and Distributed Processing Symposium, IPDPS 2022 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 1360-1371 |
Number of pages | 12 |
ISBN (Electronic) | 9781665481069 |
DOIs | |
State | Published - 2022 |
Event | 36th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2022 - Virtual, Online, France Duration: May 30 2022 → Jun 3 2022 |
Publication series
Name | Proceedings - 2022 IEEE 36th International Parallel and Distributed Processing Symposium, IPDPS 2022 |
---|
Conference
Conference | 36th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2022 |
---|---|
Country/Territory | France |
City | Virtual, Online |
Period | 05/30/22 → 06/3/22 |
Funding
We thank the reviewers for their constructive comments that helped improve the paper. This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIT)(No. NRF-2021R1A2C2014386). This work was also supported by, and used the resources of, the Oak Ridge Leadership Computing Facility, located in the National Center for Computational Sciences at ORNL, which is managed by UT Battelle, LLC for the U.S. DOE (under the contract No. DE-AC05-00OR22725).
Keywords
- Consistency
- Deduplication
- File System
- Non-Volatile Memory