Abstract
The Parallel Random Access Machines (PRAM) abstraction is the simplest and most elegant algorithmic model for the design and analysis of parallel algorithms. It consists of different models categorized based on the underlying memory access mode used, the most powerful of which is the Concurrent Read Concurrent Write (CRCW) model. A PRAM algorithm describes a series of rounds, each of which consists of a collection of operations that can be executed concurrently within the same time step. However, the lack of support for concurrent memory accesses and the prevalence of asynchronous programming models led to the belief that implementing CRCW PRAM algorithms is unattainable and prompted many to avoid this model except for theoretical studies of optimal performance. In this work, we study the arbitrary and common concurrent writes in the CRCW PRAM model and explore implementation challenges on general-purpose systems. Moreover, we examine current practices for implementing common/arbitrary concurrent writes and propose a new efficient lightweight and thread-safe method to implement concurrent writes through leveraging atomic instructions. To demonstrate the efficacy of our method, we developed OpenMP kernels for classical CRCW PRAM algorithms and provide experimental results and comparisons based on run time performance measured over the x86 multicore architecture. Our results show a performance speedup compared to current practices up to 4.5x across all our benchmarks.
Original language | English |
---|---|
Title of host publication | 50th International Conference on Parallel Processing Workshop, ICPP 2021 - Proceedings |
Publisher | Association for Computing Machinery |
ISBN (Electronic) | 9781450384414 |
DOIs | |
State | Published - Aug 9 2021 |
Event | 50th International Conference on Parallel Processing Workshop, ICPP 2021 - Virtual, Online, United States Duration: Aug 9 2021 → Aug 12 2021 |
Publication series
Name | ACM International Conference Proceeding Series |
---|
Conference
Conference | 50th International Conference on Parallel Processing Workshop, ICPP 2021 |
---|---|
Country/Territory | United States |
City | Virtual, Online |
Period | 08/9/21 → 08/12/21 |
Funding
This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.
Keywords
- Arbitrary concurrent writes
- Crcw pram
- Parallel algorithms
- Parallel architectures
- Write-conflict resolution