Group file operations for scalable tools and middleware

Michael J. Brim, Barton P. Miller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Group file operations are a new, intuitive idiom for tools and middleware - including parallel debuggers and runtimes, performance measurement and steering, and distributed resource management - that require scalable operations on large groups of distributed files. The idiom provides new semantics for using file groups in standard file operations to eliminate costly iteration. A filebased idiom promotes conciseness and portability, and eases adoption. With explicit semantics for aggregation of group results, the idiom addresses a key scalability barrier. We have designed TBON-FS, a new distributed file system that provides scalable group file operations by leveraging tree-based overlay networks (TBONs) for scalable communication and data aggregation. We integrated group file operations into several tools: parallel versions of common utilities including cp, grep, rsync, tail, and top, and the Ganglia Distributed Monitoring System. Our experience verifies the group file operation idiom is intuitive, easily adopted, and enables a wide variety of tools to run efficiently at scale.

Original languageEnglish
Title of host publication16th International Conference on High Performance Computing, HiPC 2009 - Proceedings
Pages69-78
Number of pages10
DOIs
StatePublished - 2009
Externally publishedYes
Event16th International Conference on High Performance Computing, HiPC 2009 - Kochi, India
Duration: Dec 16 2009Dec 19 2009

Publication series

Name16th International Conference on High Performance Computing, HiPC 2009 - Proceedings

Conference

Conference16th International Conference on High Performance Computing, HiPC 2009
Country/TerritoryIndia
CityKochi
Period12/16/0912/19/09

Keywords

  • Distributed
  • Group file
  • Scalable, aggregation
  • Tools

Fingerprint

Dive into the research topics of 'Group file operations for scalable tools and middleware'. Together they form a unique fingerprint.

Cite this