Abstract
Deep machine learning offers a comprehensive framework for extracting meaningful features from complex observations in an unsupervised manner. The majority of deep learning architectures described in the literature primarily focus on extracting spatial features. However, in real-world settings, capturing temporal dependencies in observations is critical for accurate inference. This paper introduces an enhancement to DeSTIN - a compositional deep learning architecture in which each layer consists of multiple instantiations of a common node - that learns to represent spatiotemporal patterns in data based on a novel recurrent clustering algorithm. Contrary to mainstream deep architectures, such as deep belief networks where layer-by-layer training is assumed, each of the nodes in the proposed architecture is trained independently and in parallel. Moreover, top-down and bottom-up information flows facilitate rich feature formation. A semi-supervised setting is demonstrated achieving state-of-the-art results on the MNIST classification benchmarks. A GPU implementation is discussed further accentuating the scalability properties of the proposed framework.
Original language | English |
---|---|
Pages (from-to) | 115-123 |
Number of pages | 9 |
Journal | Pattern Recognition Letters |
Volume | 37 |
Issue number | 1 |
DOIs | |
State | Published - Feb 1 2014 |
Externally published | Yes |
Funding
This work was supported by the Intelligence Advanced Research Projects Activity (IARPA) via Army Research Office (ARO) agreement No. W911NF-12-1-0017. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon.
Funders | Funder number |
---|---|
Army Research Office | |
Intelligence Advanced Research Projects Activity |
Keywords
- Deep machine learning
- Online clustering
- Pattern recognition
- Recurrent clustering
- Spatiotemporal signals
- Unsupervised feature extraction