TY - GEN
T1 - Symmetric active/active high availability for high-performance computing system services
T2 - CCGRID 2008 - 8th IEEE International Symposium on Cluster Computing and the Grid
AU - Engelmann, C.
AU - Scott, S. L.
AU - Leangsuksun, C.
AU - He, X.
PY - 2008
Y1 - 2008
N2 - This paper summarizes our efforts over the last 3-4 years in providing symmetric active/active high availability for high-performance computing (HPC) system services. This work paves the way for high-level reliability, availability and serviceability in extreme-scale HPC systems by focusing on the most critical components, head and service nodes, and by reinforcing them with appropriate high availability solutions. This paper presents our accomplishments in the form of concepts and respective prototypes, discusses existing limitations, outlines possible future work, and describes the relevance of this research to other, planned efforts.
AB - This paper summarizes our efforts over the last 3-4 years in providing symmetric active/active high availability for high-performance computing (HPC) system services. This work paves the way for high-level reliability, availability and serviceability in extreme-scale HPC systems by focusing on the most critical components, head and service nodes, and by reinforcing them with appropriate high availability solutions. This paper presents our accomplishments in the form of concepts and respective prototypes, discusses existing limitations, outlines possible future work, and describes the relevance of this research to other, planned efforts.
UR - http://www.scopus.com/inward/record.url?scp=50649113082&partnerID=8YFLogxK
U2 - 10.1109/CCGRID.2008.78
DO - 10.1109/CCGRID.2008.78
M3 - Conference contribution
AN - SCOPUS:50649113082
SN - 9780769531564
T3 - Proceedings CCGRID 2008 - 8th IEEE International Symposium on Cluster Computing and the Grid
SP - 813
EP - 818
BT - Proceedings CCGRID 2008 - 8th IEEE International Symposium on Cluster Computing and the Grid
Y2 - 19 May 2008 through 22 May 2008
ER -