TY - GEN
T1 - Multi-GPU acceleration of DARTEL (early detection of Alzheimer)
AU - Valero-Lara, Pedro
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/11/26
Y1 - 2014/11/26
N2 - Medical image processing is becoming a significant discipline within the bioinformatic community. In particular, deformable registration methods are one of the most sophisticate and important lines of research within biomedical image processing, due to the valuable information provided. However, these methods consume considerable processing time, power consumption and require high amounts of memory. Current Graphics Processing Units (GPU) have a high number of cores and high memory bandwidth, providing an excellent platform for reducing the cost of these methods in terms of processing time and power consumption. This work proposes several Graphics Processing Units GPU-based implementations of one of the most sophisticated deformable registration algorithms, DARTEL. The main contribution consists of a new GPU approach, which considerably reduces the overhead caused by memory transfers and the computational cost required by the parallelization of DARTEL. Furthermore, the use of multiple (2 and 4) GPUs is studied, achieving favorable results. This new approach provides a high speedup with respect to the sequential counterpart. Finally, the experimental results show a processing time reduction of more than 3 hours in typical cases of study. Additionally, this new approach significantly reduces power consumption.
AB - Medical image processing is becoming a significant discipline within the bioinformatic community. In particular, deformable registration methods are one of the most sophisticate and important lines of research within biomedical image processing, due to the valuable information provided. However, these methods consume considerable processing time, power consumption and require high amounts of memory. Current Graphics Processing Units (GPU) have a high number of cores and high memory bandwidth, providing an excellent platform for reducing the cost of these methods in terms of processing time and power consumption. This work proposes several Graphics Processing Units GPU-based implementations of one of the most sophisticated deformable registration algorithms, DARTEL. The main contribution consists of a new GPU approach, which considerably reduces the overhead caused by memory transfers and the computational cost required by the parallelization of DARTEL. Furthermore, the use of multiple (2 and 4) GPUs is studied, achieving favorable results. This new approach provides a high speedup with respect to the sequential counterpart. Finally, the experimental results show a processing time reduction of more than 3 hours in typical cases of study. Additionally, this new approach significantly reduces power consumption.
KW - Biomedical brain-image processing
KW - CUDA
KW - DARTEL
KW - GPU
KW - deformable registration
UR - http://www.scopus.com/inward/record.url?scp=84917706060&partnerID=8YFLogxK
U2 - 10.1109/CLUSTER.2014.6968783
DO - 10.1109/CLUSTER.2014.6968783
M3 - Conference contribution
AN - SCOPUS:84917706060
T3 - 2014 IEEE International Conference on Cluster Computing, CLUSTER 2014
SP - 346
EP - 354
BT - 2014 IEEE International Conference on Cluster Computing, CLUSTER 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th IEEE International Conference on Cluster Computing, CLUSTER 2014
Y2 - 22 September 2014 through 26 September 2014
ER -