Efficient object storage journaling in a distributed parallel file system

Sarp Oral, Feiyi Wang, David Dillow, Galen Shipman, Ross Miller, Oleg Drokin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

24 Scopus citations

Abstract

Journaling is a widely used technique to increase file system robustness against metadata and/or data corruptions. While the overhead of journaling can be masked by the page cache for small-scale, local file systems, we found that Lustre's use of journaling for the object store significantly impacted the overall performance of our large-scale center-wide parallel file system. By requiring that each write request wait for a journal transaction to commit, Lustre introduced serialization to the client request stream and imposed additional latency due to disk head movement (seeks) for each request. In this paper, we present the challenges we faced while deploying a very large scale production storage system. Our work provides a head-to-head comparison of two significantly different approaches to increasing the overall efficiency of the Lustre file system. First, we present a hardware solution using external journaling devices to eliminate the latencies incurred by the extra disk head seeks due to journaling. Second, we introduce a software-based optimization to remove the synchronous commit for each write request, side-stepping additional latency and amortizing the journal seeks across a much larger number of requests. Both solutions have been implemented and experimentally tested on our Spider storage system, a very large scale Lustre deployment. Our tests show both methods considerably improve the write performance, in some cases up to 93%. Testing with a real-world scientific application showed a 37% decrease in the number journal updates, each with an associated seek - which translated into an average I/O bandwidth improvement of 56.3%.

Original languageEnglish
Title of host publicationProceedings of FAST 2010
Subtitle of host publication8th USENIX Conference on File and Storage Technologies
PublisherUSENIX Association
Pages143-154
Number of pages12
ISBN (Electronic)9781931971744
StatePublished - 2010
Event8th USENIX Conference on File and Storage Technologies, FAST 2010 - San Jose, United States
Duration: Feb 23 2010Feb 26 2010

Publication series

NameProceedings of FAST 2010: 8th USENIX Conference on File and Storage Technologies

Conference

Conference8th USENIX Conference on File and Storage Technologies, FAST 2010
Country/TerritoryUnited States
CitySan Jose
Period02/23/1002/26/10

Funding

The research was sponsored by the Mathematical, Information, and Computational Sciences Division, Office of Advanced Scientific Computing Research, U.S. Department of Energy, under Contract No. DE-AC05-00OR22725 with UT-Battelle, LLC.

FundersFunder number
Computational Sciences Division
U.S. Department of EnergyDE-AC05-00OR22725
Advanced Scientific Computing Research

    Fingerprint

    Dive into the research topics of 'Efficient object storage journaling in a distributed parallel file system'. Together they form a unique fingerprint.

    Cite this