Portable Parallel Design of Weighted Multi-Dimensional Scaling for Real-Time Data Analysis

Sajal Dash, Anshuman Verma, Chris North, Wu Chun Feng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Projecting a high-dimensional dataset onto a lower dimensional space can improve the efficiency of knowledge discovery and facilitate real-time data analysis. One technique for dimension reduction, weighted multi-dimensional scaling (WMDS), approximately preserves pairwise weighted distances during the transformation; but its O(f(n)d) algorithm impedes real-time performance on large datasets. Thus, we present CLARET, our fast and portable parallel WMDS tool that combines algorithmic concepts adapted and extended from the stochastic force-based MDS (SF-MDS) and Glimmer. To further improve Claret's performance for real-time data analysis, we propose a preprocessing step that computes approximate weighted Euclidean distances by combining a novel data mapping called stretching and Johnson Lindestrauss' lemma in O(log d) time in place of the original O(d) time. This preprocessing step reduces the complexity of WMDS from O(f(n)d) to O(f(n) log d), which for large d is a significant computational gain. Finally, we present a case study of Claret by integrating it into an interactive visualization tool called V2PI to facilitate real-time analytics. To ensure the quality of the projections, we propose a geometric shape matching-based alignment process and a quality metric.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE 19th Intl Conference on High Performance Computing and Communications, HPCC 2017, 2017 IEEE 15th Intl Conference on Smart City, SmartCity 2017 and 2017 IEEE 3rd Intl Conference on Data Science and Systems, DSS 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages10-17
Number of pages8
ISBN (Electronic)9781538625880
DOIs
StatePublished - Jul 2 2017
Externally publishedYes
Event19th IEEE Intl Conference on High Performance Computing and Communications, 15th IEEE Intl Conference on Smart City, and 3rd IEEE Intl Conference on Data Science and Systems, HPCC/SmartCity/DSS 2017 - Bangkok, Thailand
Duration: Dec 18 2017Dec 20 2017

Publication series

NameProceedings - 2017 IEEE 19th Intl Conference on High Performance Computing and Communications, HPCC 2017, 2017 IEEE 15th Intl Conference on Smart City, SmartCity 2017 and 2017 IEEE 3rd Intl Conference on Data Science and Systems, DSS 2017
Volume2018-January

Conference

Conference19th IEEE Intl Conference on High Performance Computing and Communications, 15th IEEE Intl Conference on Smart City, and 3rd IEEE Intl Conference on Data Science and Systems, HPCC/SmartCity/DSS 2017
Country/TerritoryThailand
CityBangkok
Period12/18/1712/20/17

Funding

This research was supported in part by NSF grant IIS-1447416, a grant from General Dynamics Mission Systems, and a seed grant from the SEEC Center, supported by ICTAS at Virginia Tech. This research was supported in part by NSF grant IIS- 1447416, a grant from General Dynamics Mission Systems, and a seed grant from the SEEC Center, supported by ICTAS at Virginia Tech.

FundersFunder number
General Dynamics Mission Systems
SEEC Center
National Science Foundation1447416, IIS-1447416
Institute for Critical Technology and Applied Science
Norsk Sykepleierforbund

    Keywords

    • OpenCL
    • multi-dimensional scaling (MDS)
    • visual analytics
    • weighted multi-dimensional scaling (WMDS)

    Fingerprint

    Dive into the research topics of 'Portable Parallel Design of Weighted Multi-Dimensional Scaling for Real-Time Data Analysis'. Together they form a unique fingerprint.

    Cite this