TY - GEN
T1 - Experiences in extending Parallware to support OpenACC
AU - Lobeiras, Jacobo
AU - Arenaz, Manuel
AU - Hernández, Oscar
N1 - Publisher Copyright:
© 2015 ACM.
PY - 2015/11/15
Y1 - 2015/11/15
N2 - Porting scientific codes to accelerator-based computers using OpenACC and OpenMP is an important topic for the HPC community. Programmability, performance portability and developer productivity are key issues for the widespread use of these systems. In the scope of general-purpose parallel computing, Parallware is a new commercial OpenMPenabling source-to-source compiler that automatically adds OpenMP capabilities in scientific programs. Thus, extending Parallware with OpenACC or OpenMP 4.x support would contribute to improve programmability and developer productivity. In contrast, the performance portability of such approach needs to be demonstrated in practice. This paper presents a preliminary study to extend Parallware with OpenACC support for GPU devices. A simple benchmark suite has been designed to mimic important features and computational patterns of real scientific applications. Handcoded OpenACC versions are compared to OpenMP versions automatically generated by Parallware. Performance is evaluated with the PGI OpenACC compiler on systems accelerated with NVIDIA GPUS.
AB - Porting scientific codes to accelerator-based computers using OpenACC and OpenMP is an important topic for the HPC community. Programmability, performance portability and developer productivity are key issues for the widespread use of these systems. In the scope of general-purpose parallel computing, Parallware is a new commercial OpenMPenabling source-to-source compiler that automatically adds OpenMP capabilities in scientific programs. Thus, extending Parallware with OpenACC or OpenMP 4.x support would contribute to improve programmability and developer productivity. In contrast, the performance portability of such approach needs to be demonstrated in practice. This paper presents a preliminary study to extend Parallware with OpenACC support for GPU devices. A simple benchmark suite has been designed to mimic important features and computational patterns of real scientific applications. Handcoded OpenACC versions are compared to OpenMP versions automatically generated by Parallware. Performance is evaluated with the PGI OpenACC compiler on systems accelerated with NVIDIA GPUS.
KW - Compiler-assisted parallelization
KW - Openacc
KW - Openmp
KW - Parallware
UR - https://www.scopus.com/pages/publications/84959386268
U2 - 10.1145/2832105.2832112
DO - 10.1145/2832105.2832112
M3 - Conference contribution
AN - SCOPUS:84959386268
T3 - Proceedings of WACCPD 2015: 2nd Workshop on Accelerator Programming Using Directives - Held in conjunction with SC 2015: The International Conference for High Performance Computing, Networking, Storage and Analysis
BT - Proceedings of WACCPD 2015
PB - Association for Computing Machinery
T2 - 2nd Workshop on Accelerator Programming Using Directives, WACCPD 2015 Held in conjunction with The International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2015
Y2 - 16 November 2015 through 16 November 2015
ER -