TY - GEN
T1 - Extending the OpenSHMEM memory model to support user-defined spaces
AU - Welch, Aaron
AU - Hernandez, Oscar
AU - Pophale, Swaroop
AU - Poole, Stephen
AU - Shamis, Pavel
AU - Chapman, Barbara
PY - 2014/10/6
Y1 - 2014/10/6
N2 - OpenSHMEM is an open standard for SHMEM libraries. With the standardisation process complete, the community is looking towards extending the API for increasing programmer flexibility and extreme scalability. According to the current OpenSHMEM specification (revision 1.1), allocation of symmetric memory is collective across all PEs executing the application. For better work sharing and memory utilisation, we are proposing the concepts of teams and spaces for OpenSHMEM that together allow allocation of memory only across user-specified teams. Through our implementation we show that by using teams we can confine memory allocation and usage to only the PEs that actually communicate via symmetric memory. We provide our preliminary results that demonstrate creating spaces for teams allows for less consumption of memory resources than the current alternative. We also examine the impact of our extensions on Scalable Synthetic Compact Applications #3(SSCA3), which is a sensor processing and knowledge formation kernel involving file I/O, and show that up to 30% of symmetric memory allocation can be eliminated without affecting the correctness of the benchmark. Copyright is held by the owner/author(s). Publication rights licensed to ACM.
AB - OpenSHMEM is an open standard for SHMEM libraries. With the standardisation process complete, the community is looking towards extending the API for increasing programmer flexibility and extreme scalability. According to the current OpenSHMEM specification (revision 1.1), allocation of symmetric memory is collective across all PEs executing the application. For better work sharing and memory utilisation, we are proposing the concepts of teams and spaces for OpenSHMEM that together allow allocation of memory only across user-specified teams. Through our implementation we show that by using teams we can confine memory allocation and usage to only the PEs that actually communicate via symmetric memory. We provide our preliminary results that demonstrate creating spaces for teams allows for less consumption of memory resources than the current alternative. We also examine the impact of our extensions on Scalable Synthetic Compact Applications #3(SSCA3), which is a sensor processing and knowledge formation kernel involving file I/O, and show that up to 30% of symmetric memory allocation can be eliminated without affecting the correctness of the benchmark. Copyright is held by the owner/author(s). Publication rights licensed to ACM.
UR - http://www.scopus.com/inward/record.url?scp=84939244256&partnerID=8YFLogxK
U2 - 10.1145/2676870.2676884
DO - 10.1145/2676870.2676884
M3 - Conference contribution
AN - SCOPUS:84939244256
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, PGAS 2014
A2 - Broman, David
A2 - Couture, Nadine
A2 - Bastien, Christian
A2 - Broman, David
A2 - Pepper, Peter
A2 - Dorta, Tomas
PB - Association for Computing Machinery
T2 - 8th International Conference on Partitioned Global Address Space Programming Models, PGAS 2014
Y2 - 6 October 2014 through 10 October 2014
ER -