IPMI-bascd efficient notification framework for large scale cluster computing

Chokchai Leangsuksun, Tirumala Rao, Anand Tikotekar, Stephen L. Scott, Richard Libby, Jeffrey S. Vetter, Yung Chin Fang, Hong Ong

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

The demand for an efficient fault tolerance system has led to the development of complex monitoring infrastructure, which in turn has created cm overwhelming task of data and event management. The increasing level of details at the hardware and software layer clearly affects the scalability and performance of monitoring and management tools. In this paper, we propose a problem notification framework that directly addresses the issue of monitor scalability. We first present the design and implementation of our step-by-step approach to analysing, filtering, and classifying the plethora of node statistics. Then, we present experimental results to show that our approach only needs minimal system resource and thus has low overhead. Finally, we introduce our web-based cluster management system that provides hardware controls at both cluster and nodal levels.

Original languageEnglish
Title of host publicationSixth IEEE International Symposium on Cluster Computing and the Grid Workshop, 2006. CCGRID 06
StatePublished - 2006
Event6th IEEE International Symposium on Cluster Computing and the Grid, 2006. CCGRID 06 - , Singapore
Duration: May 16 2006May 19 2006

Publication series

NameSixth IEEE International Symposium on Cluster Computing and the Grid Workshops, 2006. CCGRID 06

Conference

Conference6th IEEE International Symposium on Cluster Computing and the Grid, 2006. CCGRID 06
Country/TerritorySingapore
Period05/16/0605/19/06

Keywords

  • High-availability IPMI
  • Scalability

Fingerprint

Dive into the research topics of 'IPMI-bascd efficient notification framework for large scale cluster computing'. Together they form a unique fingerprint.

Cite this