Abstract
The Cray X1 supercomputer's distributed shared memory presents a 64-bit global address space that is directly addressable from every MSP with an interconnect bandwidth per computation rate of 1 byte/flop. Our results show that this high bandwidth and low latency for remote memory accesses translate into improved application performance on important applications.
Original language | English |
---|---|
Pages (from-to) | 30-40 |
Number of pages | 11 |
Journal | IEEE Micro |
Volume | 25 |
Issue number | 1 |
DOIs | |
State | Published - Jan 2005 |
Funding
We gratefully acknowledge Cray Inc. for its ongoing cooperation and, in particular, Steve Scott, James Schwarzmeier, and Nathan Wichmann. The work described in this article was sponsored by the Office of Mathematical, Information, and Computational Sciences, Office of Science, US Department of Energy under Contract No. DE-AC05-00OR22725 with UT-Battelle, LLC. Accordingly, the US government retains a nonexclusive, royalty-free license to publish or reproduce the published form of this contribution, or allow others to do so, for US Government purposes.