Abstract
The symmetric rank-k update (SYRK) is a level-3 BLAS routine commonly used by many Data Mining/Machine Learning(DM/ML) algorithms such as regression, dimensionality reduction algorithms like PCA, matrix factorization and k-mean clustering. This paper presents a comprehensive analysis of the SYRK routine under popular dense linear algebra libraries such as OpenBLAS, Intel MKL, and BLIS particularly focusing on edge cases of dense matrices (thin or fat shapes) that are common in DM/ML applications. Our work identifies some performance issues of the SYRK routine in multi-threaded shared memory environments for edge cases and discuss matrix dependent modifications for performance improvement.
Original language | English |
---|---|
Title of host publication | 2018 21st International Conference of Computer and Information Technology, ICCIT 2018 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781538692424 |
DOIs | |
State | Published - Jul 2 2018 |
Event | 21st International Conference of Computer and Information Technology, ICCIT 2018 - Dhaka, Bangladesh Duration: Dec 21 2018 → Dec 23 2018 |
Publication series
Name | 2018 21st International Conference of Computer and Information Technology, ICCIT 2018 |
---|
Conference
Conference | 21st International Conference of Computer and Information Technology, ICCIT 2018 |
---|---|
Country/Territory | Bangladesh |
City | Dhaka |
Period | 12/21/18 → 12/23/18 |
Funding
This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Keywords
- BLAS
- Multicore
- Performance Issues of SYRK
- Scalability of AA