Fast rule mining over multi-dimensional windows

Mahashweta Das, P. Deepak, Prasad M. Deshpande, Ramakrishnan Kannan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Association rule mining is an indispensable tool for discovering insights from large databases and data warehouses. The data in a warehouse being multi-dimensional, it is often useful to mine rules over subsets of data defined by selections over the dimensions. Such interactive rule mining over multi-dimensional query windows is difficult since rule mining is computationally expensive. Current methods using pre-computation of frequent itemsets require counting of some itemsets by revisiting the transaction database at query time, which is very expensive. We develop a method (RMW) that identifies the minimal set of itemsets to compute and store for each cell, so that rule mining over any query window may be performed without going back to the transaction database. We give formal proofs that the set of itemsets chosen by RMW is sufficient to answer any query and also prove that it is the optimal set to be computed for 1 dimensional queries. We demonstrate through an extensive empirical evaluation that RMW achieves extremely fast query response time compared to existing methods, with only moderate overhead in pre-computation and storage.

Original languageEnglish
Title of host publicationProceedings of the 11th SIAM International Conference on Data Mining, SDM 2011
PublisherSociety for Industrial and Applied Mathematics Publications
Pages582-593
Number of pages12
ISBN (Print)9780898719925
DOIs
StatePublished - 2011
Externally publishedYes
Event11th SIAM International Conference on Data Mining, SDM 2011 - Mesa, AZ, United States
Duration: Apr 28 2011Apr 30 2011

Publication series

NameProceedings of the 11th SIAM International Conference on Data Mining, SDM 2011

Conference

Conference11th SIAM International Conference on Data Mining, SDM 2011
Country/TerritoryUnited States
CityMesa, AZ
Period04/28/1104/30/11

Fingerprint

Dive into the research topics of 'Fast rule mining over multi-dimensional windows'. Together they form a unique fingerprint.

Cite this