Performance portability study of epistasis detection using SYCL on NVIDIA GPU

Zheming Jin, Jeffrey S. Vetter

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

We describe the experience of converting a CUDA implementation of a high-order epistasis detection algorithm to SYCL. The goals are for our work to be useful to application and compiler developers with a detailed description of migration paths between CUDA and SYCL. Evaluating the CUDA and SYCL applications on an NVIDIA V100 GPU, we find that the optimization of loop unrolling needs to be applied manually to the SYCL kernel for obtaining comparable performance. The performance of the SYCL group reduce function, an alternative to the CUDA warp-based reduction, depends on the problem and work group sizes. The 64-bit popcount operation implemented with tree of adders is slightly faster than the built-in popcount operation. When the number of OpenMP threads is four, the highest performance of the SYCL and CUDA applications are comparable.

Original languageEnglish
Title of host publicationProceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2022
PublisherAssociation for Computing Machinery, Inc
ISBN (Electronic)9781450393867
DOIs
StatePublished - Aug 7 2022
Event13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2022 - Chicago, United States
Duration: Aug 7 2022Aug 8 2022

Publication series

NameProceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2022

Conference

Conference13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, BCB 2022
Country/TerritoryUnited States
CityChicago
Period08/7/2208/8/22

Funding

This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).

FundersFunder number
U.S. Department of Energy

    Keywords

    • Epistasis
    • GPU
    • Portability
    • Programming model

    Fingerprint

    Dive into the research topics of 'Performance portability study of epistasis detection using SYCL on NVIDIA GPU'. Together they form a unique fingerprint.

    Cite this