Abstract
Projecting a high-dimensional dataset onto a lower dimensional space can improve the efficiency of knowledge discovery and facilitate real-time data analysis. One technique for dimension reduction, weighted multi-dimensional scaling (WMDS), approximately preserves pairwise weighted distances during the transformation; but its O(f(n)d) algorithm impedes real-time performance on large datasets. Thus, we present CLARET, our fast and portable parallel WMDS tool that combines algorithmic concepts adapted and extended from the stochastic force-based MDS (SF-MDS) and Glimmer. To further improve Claret's performance for real-time data analysis, we propose a preprocessing step that computes approximate weighted Euclidean distances by combining a novel data mapping called stretching and Johnson Lindestrauss' lemma in O(log d) time in place of the original O(d) time. This preprocessing step reduces the complexity of WMDS from O(f(n)d) to O(f(n) log d), which for large d is a significant computational gain. Finally, we present a case study of Claret by integrating it into an interactive visualization tool called V2PI to facilitate real-time analytics. To ensure the quality of the projections, we propose a geometric shape matching-based alignment process and a quality metric.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2017 IEEE 19th Intl Conference on High Performance Computing and Communications, HPCC 2017, 2017 IEEE 15th Intl Conference on Smart City, SmartCity 2017 and 2017 IEEE 3rd Intl Conference on Data Science and Systems, DSS 2017 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 10-17 |
| Number of pages | 8 |
| ISBN (Electronic) | 9781538625880 |
| DOIs | |
| State | Published - Jul 2 2017 |
| Externally published | Yes |
| Event | 19th IEEE Intl Conference on High Performance Computing and Communications, 15th IEEE Intl Conference on Smart City, and 3rd IEEE Intl Conference on Data Science and Systems, HPCC/SmartCity/DSS 2017 - Bangkok, Thailand Duration: Dec 18 2017 → Dec 20 2017 |
Publication series
| Name | Proceedings - 2017 IEEE 19th Intl Conference on High Performance Computing and Communications, HPCC 2017, 2017 IEEE 15th Intl Conference on Smart City, SmartCity 2017 and 2017 IEEE 3rd Intl Conference on Data Science and Systems, DSS 2017 |
|---|---|
| Volume | 2018-January |
Conference
| Conference | 19th IEEE Intl Conference on High Performance Computing and Communications, 15th IEEE Intl Conference on Smart City, and 3rd IEEE Intl Conference on Data Science and Systems, HPCC/SmartCity/DSS 2017 |
|---|---|
| Country/Territory | Thailand |
| City | Bangkok |
| Period | 12/18/17 → 12/20/17 |
Funding
This research was supported in part by NSF grant IIS-1447416, a grant from General Dynamics Mission Systems, and a seed grant from the SEEC Center, supported by ICTAS at Virginia Tech. This research was supported in part by NSF grant IIS- 1447416, a grant from General Dynamics Mission Systems, and a seed grant from the SEEC Center, supported by ICTAS at Virginia Tech.
Keywords
- OpenCL
- multi-dimensional scaling (MDS)
- visual analytics
- weighted multi-dimensional scaling (WMDS)