Multivariate geographic clustering in a metacomputing environment using globus

G. Mahinthakumar, Forrest M. Hoffman, William W. Hargrove, Nicholas T. Karonis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Scopus citations

Abstract

The authors present a metacomputing application of multivariate, nonhierarchical statistical clustering to geographic environmental data from the 48 conterminous United States in order to produce maps of regions of ecological similarity, called ecoregions. These maps represent finer scale regionalizations than do those generated by the traditional technique: an expert with a marker pen. Several variables (e.g., temperature, organic matter, rainfall etc.) thought to affect the growth of vegetation are clustered at resolutions as fine as one square kilometer (1 km2). These data can represent over 7.8 million map cells in an n-dimensional (n = 9 to 25) data space. A parallel version of the iterative statistical clustering algorithm is developed by the authors using the MPI (Message Passing Interface) message passing routines. The parallel algorithm uses a classical, self-scheduling, single-program, multiple data (SPMD) organization; performs dynamic load balancing for reasonable performance in heterogeneous metacomputing environments; and provides fault tolerance by saving intermediate results for easy restarts in case of hardware failure. The parallel algorithm was tested on various geographically distributed heterogeneous metacomputing configurations involving an IBM SP3™, an IBM SP2™, and two SGI Origin 2000™'s. The tests were performed with minimal code modification, and were made possible by Globus™ (a metacomputing software toolkit) and the Globus-enabled version of MPI (MPICH-G). Our performance tests indicate that while the algorithm works reasonably well under the metacomputing environment for a moderate number of processors, the communication overhead can become prohibitive for large processor configurations.

Original languageEnglish
Title of host publicationACM/IEEE SC 1999 Conference, SC 1999
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5
Number of pages1
ISBN (Electronic)1581130910, 9781581130911
DOIs
StatePublished - 1999
Event1999 ACM/IEEE Conference on Supercomputing, SC 1999 - Portland, United States
Duration: Nov 13 1999Nov 19 1999

Publication series

NameACM/IEEE SC 1999 Conference, SC 1999

Conference

Conference1999 ACM/IEEE Conference on Supercomputing, SC 1999
Country/TerritoryUnited States
CityPortland
Period11/13/9911/19/99

Funding

________________________ *The submitted manuscript has been authored by a contractor of the U.S. Government under contract No. DE–AC05–96OR22464. Accordingly, the U.S. Government retains a nonexclusive, royalty–free license to publish or reproduce the published form of this contribution, or allow others to do so, for U.S. Government purposes Oak Ridge National Laboratory, managed by Lockheed Martin Energy Research Corp. for the U.S. Department of Energy under contract number DE–AC05–96OR22464 Oak Ridge National Laboratory, managed by Lockheed Martin Energy Research Corp. for the U.S. Department of Energy under contract number DE-AC05-96OR22464

FundersFunder number
U.S. GovernmentDE–AC05–96OR22464
U.S. Department of EnergyDE-AC05-96OR22464

    Fingerprint

    Dive into the research topics of 'Multivariate geographic clustering in a metacomputing environment using globus'. Together they form a unique fingerprint.

    Cite this