TY - GEN
T1 - An intelligent bandwidth manager for CNN applications on embedded devices
AU - Kumar Pasupuleti, Sirish
AU - Rajaram, Aishwarya
AU - Rao Miniskar, Narasinga
AU - Narayana Gadde, Raj
AU - Yadvandu, Deepanshu
AU - Rajagopal, Vasanthakumar
AU - Vishnoi, Ashok
AU - Kumar Ramasamy, Chandra
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/8/29
Y1 - 2018/8/29
N2 - Adapting complex Convolution Neural Network (CNN) applications on embedded processors is a challenge due to the massive memory bandwidth and computational requirements. In particular, the CNN memory bandwidth requirement poses a huge challenge for the processors with Scratch Pad Memory (SPM), usually of limited size. In this paper, we present an Intelligent Bandwidth Manager (IBWM) to efficiently handle the CNN bandwidth for SPM based processors. The proposed IBWM is a two fold approach which includes Intelligent SPM Manager (ISM) to optimize the number of accesses to SDRAM by analysing the data patterns, and Feature Map Compression (FMC) to further reduce the bandwidth by exploiting the feature map data sparsity. The IBWM is independent of any processor architecture and can be adopted in any processor with SPM. The proposed IBWM is experimented with ResNet-50 [1] and AlexNet [2] networks on a Samsung Reconfigurable Processor (SRP) [3] for various SPM sizes. The SDRAM bandwidth results show, 2x improvement compared to MIT Eyeriss [4] for AlexNet, and 4x-8x improvement compared to primitive bandwidth management techniques for AlexNet and ResNet-50. The proposed method achieves the bandwidth closer to the minimum possible bandwidth.
AB - Adapting complex Convolution Neural Network (CNN) applications on embedded processors is a challenge due to the massive memory bandwidth and computational requirements. In particular, the CNN memory bandwidth requirement poses a huge challenge for the processors with Scratch Pad Memory (SPM), usually of limited size. In this paper, we present an Intelligent Bandwidth Manager (IBWM) to efficiently handle the CNN bandwidth for SPM based processors. The proposed IBWM is a two fold approach which includes Intelligent SPM Manager (ISM) to optimize the number of accesses to SDRAM by analysing the data patterns, and Feature Map Compression (FMC) to further reduce the bandwidth by exploiting the feature map data sparsity. The IBWM is independent of any processor architecture and can be adopted in any processor with SPM. The proposed IBWM is experimented with ResNet-50 [1] and AlexNet [2] networks on a Samsung Reconfigurable Processor (SRP) [3] for various SPM sizes. The SDRAM bandwidth results show, 2x improvement compared to MIT Eyeriss [4] for AlexNet, and 4x-8x improvement compared to primitive bandwidth management techniques for AlexNet and ResNet-50. The proposed method achieves the bandwidth closer to the minimum possible bandwidth.
KW - CNN bandwidth management
KW - Convolution Neural Networks
KW - Feature map compression
UR - http://www.scopus.com/inward/record.url?scp=85062909645&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2018.8451706
DO - 10.1109/ICIP.2018.8451706
M3 - Conference contribution
AN - SCOPUS:85062909645
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 4173
EP - 4177
BT - 2018 IEEE International Conference on Image Processing, ICIP 2018 - Proceedings
PB - IEEE Computer Society
T2 - 25th IEEE International Conference on Image Processing, ICIP 2018
Y2 - 7 October 2018 through 10 October 2018
ER -