TY - JOUR
T1 - Improvement of parallelization efficiency of batch pattern BP training algorithm using Open MPI
AU - Turchenko, Volodymyr
AU - Grandinetti, Lucio
AU - Bosilca, George
AU - Dongarra, Jack J.
PY - 2010
Y1 - 2010
N2 - The use of tuned collective's module of Open MPI to improve a parallelization efficiency of parallel batch pattern back propagation training algorithm of a multilayer perceptron is considered in this paper. The multilayer perceptron model and the usual sequential batch pattern training algorithm are theoretically described. An algorithmic description of a parallel version of the batch pattern training method is introduced. The obtained parallelization efficiency results using Open MPI tuned collective's module and MPICH2 are compared. Our results show that (i) Open MPI tuned collective's module outperforms MPICH2 implementation both on SMP computer and computational cluster and (ii) different internal algorithms of MPI-Allreduce() collective operation give better results on different scenarios and different parallel systems. Therefore the properties of the communication network and user application should be taken into account when a specific collective algorithm is used.
AB - The use of tuned collective's module of Open MPI to improve a parallelization efficiency of parallel batch pattern back propagation training algorithm of a multilayer perceptron is considered in this paper. The multilayer perceptron model and the usual sequential batch pattern training algorithm are theoretically described. An algorithmic description of a parallel version of the batch pattern training method is introduced. The obtained parallelization efficiency results using Open MPI tuned collective's module and MPICH2 are compared. Our results show that (i) Open MPI tuned collective's module outperforms MPICH2 implementation both on SMP computer and computational cluster and (ii) different internal algorithms of MPI-Allreduce() collective operation give better results on different scenarios and different parallel systems. Therefore the properties of the communication network and user application should be taken into account when a specific collective algorithm is used.
KW - Multilayer perceptron
KW - Open MPI
KW - Parallelization efficiency
KW - Tuned collective's module
UR - http://www.scopus.com/inward/record.url?scp=78650277770&partnerID=8YFLogxK
U2 - 10.1016/j.procs.2010.04.056
DO - 10.1016/j.procs.2010.04.056
M3 - Article
AN - SCOPUS:78650277770
SN - 1877-0509
VL - 1
SP - 525
EP - 533
JO - Procedia Computer Science
JF - Procedia Computer Science
IS - 1
ER -