Implementing efficient data compression and encryption in a persistent key-value store for HPC

Jungwon Kim, Jeffrey S. Vetter

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Recently, persistent data structures, like key-value stores (KVSs), which are stored in a high-performance computing (HPC) system’s nonvolatile memory, provide an attractive solution for a number of emerging challenges like limited I/O performance. Data compression and encryption are two well-known techniques for improving several properties of such data-oriented systems. This article investigates how to efficiently integrate data compression and encryption into persistent KVSs for HPC with the ultimate goal of hiding their costs and complexity in terms of performance and ease of use. Our compression technique exploits deep memory hierarchy in an HPC system to achieve both storage reduction and performance improvement. Our encryption technique provides a practical level of security and enables sharing of sensitive data securely in complex scientific workflows with nearly imperceptible cost. We implement the proposed techniques on top of a distributed embedded KVS to evaluate the benefits and costs of incorporating these capabilities along different points in the dataflow path, illustrating differences in effective bandwidth, latency, and additional computational expense on Swiss National Supercomputing Centre’s Grand Tavé and National Energy Research Scientific Computing Center’s Cori.

Original languageEnglish
Pages (from-to)1098-1112
Number of pages15
JournalInternational Journal of High Performance Computing Applications
Volume33
Issue number6
DOIs
StatePublished - Nov 1 2019

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by a grant from the CSCS under the Accelerated Data Analytics and Computing collaboration. This research used resources of the NERSC, a US Department of Energy Office of Science User Facility operated under contract no DE-AC02-05CH11231. This research was also supported by the Exascale Computing Project (17-SC-20-SC), a collaborative effort of the US Department of Energy Office of Science and the National Nuclear Security Administration. This material is based upon work supported by the US Department of Energy, Office of Science, Office of Advanced Scientific Computing Research under contract number DE-AC05-00OR22725.

Keywords

  • High-performance computing
  • distributed systems
  • nonvolatile memory
  • persistent memory
  • programming systems

Fingerprint

Dive into the research topics of 'Implementing efficient data compression and encryption in a persistent key-value store for HPC'. Together they form a unique fingerprint.

Cite this