Abstract
This paper studies Strassen's matrix multiplication algorithm by implementing it in a variety of methods: sequential, workflow, and in parallel. All the methods show better performance than the well-known scientific libraries for medium to large size matrices. The sequential recursive program is implemented and compared with ATLAS'S DGEMM subroutine. A workflow program in the NetSolve system and two parallel programs based on MPI and ScaLAPACK are also implemented. By analyzing the time complexity and memory requirement of each method, we provide insight into how to utilize Strassen's Algorithm to speedup matrix multiplication based on existing high performance tools or libraries.
Original language | English |
---|---|
Pages (from-to) | 415-421 |
Number of pages | 7 |
Journal | Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Systems |
State | Published - 2006 |
Externally published | Yes |
Event | 18th IASTED International Conference on Parallel and Distributed Computing and Systems, PDCS 2006 - Dallas, TX, United States Duration: Nov 13 2006 → Nov 15 2006 |
Keywords
- Matrix multiplication
- Parallel computing
- Strassen's algorithm