TY - GEN
T1 - Fast tree-based algorithms for DBSCAN for low-dimensional data on GPUs
AU - Prokopenko, Andrey
AU - Lebrun-Grandié, Damien
AU - Arndt, Daniel
N1 - Publisher Copyright:
© 2023 Association for Computing Machinery. All rights reserved.
PY - 2023/8/7
Y1 - 2023/8/7
N2 - DBSCAN is a well-known density-based clustering algorithm to discover arbitrary shape clusters. While conceptually simple in serial, the algorithm is challenging to efficiently parallelize on many-core GPU architectures. Common pitfalls, such as asynchronous range query calls, result in high thread execution divergence in many implementations. In this paper, we propose a new framework for GPU-accelerated DBSCAN, and describe two tree-based algorithms within that framework. Both algorithms fuse the search for neighbors with updating cluster information, but differ in their treatment of dense regions of the data. We show that the time taken to compute clusters is at most twice that of determination of the neighbors. We compare the proposed algorithms with existing CPU and GPU implementations, and demonstrate their competitiveness and performance using a fast traversal structure (bounding volume hierarchy) for low dimensional data. We also show that the memory usage can be reduced by processing object neighbors dynamically without storing them.
AB - DBSCAN is a well-known density-based clustering algorithm to discover arbitrary shape clusters. While conceptually simple in serial, the algorithm is challenging to efficiently parallelize on many-core GPU architectures. Common pitfalls, such as asynchronous range query calls, result in high thread execution divergence in many implementations. In this paper, we propose a new framework for GPU-accelerated DBSCAN, and describe two tree-based algorithms within that framework. Both algorithms fuse the search for neighbors with updating cluster information, but differ in their treatment of dense regions of the data. We show that the time taken to compute clusters is at most twice that of determination of the neighbors. We compare the proposed algorithms with existing CPU and GPU implementations, and demonstrate their competitiveness and performance using a fast traversal structure (bounding volume hierarchy) for low dimensional data. We also show that the memory usage can be reduced by processing object neighbors dynamically without storing them.
KW - DBSCAN
KW - GPU
KW - bounding volume hierarchy
KW - parallel algorithm
UR - http://www.scopus.com/inward/record.url?scp=85179895067&partnerID=8YFLogxK
U2 - 10.1145/3605573.3605594
DO - 10.1145/3605573.3605594
M3 - Conference contribution
AN - SCOPUS:85179895067
T3 - ACM International Conference Proceeding Series
SP - 503
EP - 512
BT - 52nd International Conference on Parallel Processing, ICPP 2023 - Main Conference Proceedings
PB - Association for Computing Machinery
T2 - 52nd International Conference on Parallel Processing, ICPP 2023
Y2 - 7 August 2023 through 10 August 2023
ER -