Abstract
Canonical Polyadic tensor decomposition using alternate Poisson regression (CP-APR) is an effective analysis tool for large sparse count datasets. One of the variants using projected damped Newton optimization for row subproblems (PDNR) offers quadratic convergence and is amenable to parallelization. Despite its potential effectiveness, PDNR performance on modern high performance computing (HPC) systems is not well understood. To remedy this, we have developed a parallel implementation of PDNR using Kokkos, a performance portable parallel programming framework supporting efficient runtime of a single code base on multiple HPC systems. We demonstrate that the performance of parallel PDNR can be poor if load imbalance associated with the irregular distribution of nonzero entries in the tensor data is not addressed. Preliminary results using tensors from the FROSTT data set indicate that using multiple kernels to address this imbalance when solving the PDNR row subproblems in parallel can improve performance, with up to 80% speedup on CPUs and 10-fold speedup on NVIDIA GPUs.
| Original language | English |
|---|---|
| Title of host publication | 2020 IEEE High Performance Extreme Computing Conference, HPEC 2020 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781728192192 |
| DOIs | |
| State | Published - Sep 22 2020 |
| Externally published | Yes |
| Event | 2020 IEEE High Performance Extreme Computing Conference, HPEC 2020 - Virtual, Waltham, United States Duration: Sep 21 2020 → Sep 25 2020 |
Publication series
| Name | 2020 IEEE High Performance Extreme Computing Conference, HPEC 2020 |
|---|
Conference
| Conference | 2020 IEEE High Performance Extreme Computing Conference, HPEC 2020 |
|---|---|
| Country/Territory | United States |
| City | Virtual, Waltham |
| Period | 09/21/20 → 09/25/20 |
Funding
Sandia National Laboratories is a multimission laboratory managed and operated by National Technology & Engineering Solutions of Sandia, LLC, a wholly owned subsidiary of Hon-eywell International Inc., for the U.S. Department of Energy’s National Nuclear Security Administration under contract DE-NA0003525.
Keywords
- GPU
- Kokkos
- Poisson factorization
- multicore
- tensor decomposition