Integrated Clustering and Anomaly Detection (INCAD) for Streaming Data

Sreelekha Guggilam, Syed Mohammed Arshad Zaidi, Varun Chandola, Abani K. Patra

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Most current clustering based anomaly detection methods use scoring schema and thresholds to classify anomalies. These methods are often tailored to target specific data sets with “known” number of clusters. The paper provides a streaming clustering and anomaly detection algorithm that does not require strict arbitrary thresholds on the anomaly scores or knowledge of the number of clusters while performing probabilistic anomaly detection and clustering simultaneously. This ensures that the cluster formation is not impacted by the presence of anomalous data, thereby leading to more reliable definition of “normal vs abnormal” behavior. The motivations behind developing the INCAD model [17] and the path that leads to the streaming model are discussed.

Original languageEnglish
Title of host publicationComputational Science – ICCS 2019 - 19th International Conference, Proceedings
EditorsJoão M.F. Rodrigues, Pedro J.S. Cardoso, Jânio Monteiro, Roberto Lam, Valeria V. Krzhizhanovskaya, Michael H. Lees, Peter M.A. Sloot, Jack J. Dongarra
PublisherSpringer Verlag
Pages45-59
Number of pages15
ISBN (Print)9783030227463
DOIs
StatePublished - 2019
Externally publishedYes
Event19th International Conference on Computational Science, ICCS 2019 - Faro, Portugal
Duration: Jun 12 2019Jun 14 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11539 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Computational Science, ICCS 2019
Country/TerritoryPortugal
CityFaro
Period06/12/1906/14/19

Funding

Acknowledgements. The authors would like to acknowledge University at Buffalo Center for Computational Research (http://www.buffalo.edu/ccr.html) for its computing resources that were made available for conducting the research reported in this paper. Financial support of the National Science Foundation Grant numbers NSF/OAC 1339765 and NSF/DMS 1621853 is acknowledged. The authors would like to acknowledge University at Buffalo Center for Computational Research (http://www.buffalo.edu/ccr.html) for its computing resources that were made available for conducting the research reported in this paper. Financial support of the National Science Foundation Grant numbers NSF/OAC 1339765 and NSF/DMS 1621853 is acknowledged.

FundersFunder number
NSF/DMS1621853
NSF/OAC
National Science Foundation
Directorate for Computer and Information Science and Engineering1339765
University at Buffalo

    Keywords

    • Anomaly detection
    • Bayesian non-parametric models
    • Clustering based anomaly detection
    • Extreme value theory

    Fingerprint

    Dive into the research topics of 'Integrated Clustering and Anomaly Detection (INCAD) for Streaming Data'. Together they form a unique fingerprint.

    Cite this