TY - GEN
T1 - A 1TOPS/W analog deep machine-learning engine with floating-gate storage in 0.13µm CMOS
AU - Lu, Junjie
AU - Young, Steven
AU - Arel, Itamar
AU - Holleman, Jeremy
PY - 2014
Y1 - 2014
N2 - Direct processing of raw high-dimensional data such as images and video by machine learning systems is impractical both due to prohibitive power consumption and the "curse of dimensionality," which makes learning tasks exponentially more difficult as dimension increases. Deep machine learning (DML) mimics the hierarchical presentation of information in the human brain to achieve robust automated feature extraction, reducing the dimension of such data. However, the computational complexity of DML systems limits large-scale implementations in standard digital computers. Custom analog or mixed-mode signal processors have been reported to yield much higher energy efficiency than DSP [1-4], presenting the means of overcoming these limitations. However, the use of volatile digital memory in [1-3] precludes their use in intermittently-powered devices, and the required interfacing and internal A/D/A conversions add power and area overhead. Nonvolatile storage is employed in [4], but the lack of learning capability requires task-specific programming before operation, and precludes online adaptation.
AB - Direct processing of raw high-dimensional data such as images and video by machine learning systems is impractical both due to prohibitive power consumption and the "curse of dimensionality," which makes learning tasks exponentially more difficult as dimension increases. Deep machine learning (DML) mimics the hierarchical presentation of information in the human brain to achieve robust automated feature extraction, reducing the dimension of such data. However, the computational complexity of DML systems limits large-scale implementations in standard digital computers. Custom analog or mixed-mode signal processors have been reported to yield much higher energy efficiency than DSP [1-4], presenting the means of overcoming these limitations. However, the use of volatile digital memory in [1-3] precludes their use in intermittently-powered devices, and the required interfacing and internal A/D/A conversions add power and area overhead. Nonvolatile storage is employed in [4], but the lack of learning capability requires task-specific programming before operation, and precludes online adaptation.
UR - http://www.scopus.com/inward/record.url?scp=84898062031&partnerID=8YFLogxK
U2 - 10.1109/ISSCC.2014.6757532
DO - 10.1109/ISSCC.2014.6757532
M3 - Conference contribution
AN - SCOPUS:84898062031
SN - 9781479909186
T3 - Digest of Technical Papers - IEEE International Solid-State Circuits Conference
SP - 504
EP - 505
BT - 2014 IEEE International Solid-State Circuits Conference, ISSCC 2014 - Digest of Technical Papers
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 61st IEEE International Solid-State Circuits Conference, ISSCC 2014
Y2 - 9 February 2014 through 13 February 2014
ER -