Abstract
Anomaly detection for time series data is often aimed at identifying extreme behaviors within an individual time series. However, identifying extreme trends relative to a collection of other time series is of significant interest, like in the fields of public health policy, social justice and pandemic propagation. We propose an algorithm that can scale to large collections of time series data using the concepts from the theory of large deviations. Exploiting the ability of the algorithm to scale to high-dimensional data, we propose an online anomaly detection method to identify anomalies in a collection of multivariate time series. We demonstrate the applicability of the proposed Large Deviations Anomaly Detection (LAD) algorithm in identifying counties in the United States with anomalous trends in terms of COVID-19 related cases and deaths. Several of the identified anomalous counties correlate with counties with documented poor response to the COVID pandemic.
Original language | English |
---|---|
Title of host publication | Computational Science - ICCS 2022, 22nd International Conference, Proceedings |
Editors | Derek Groen, Clélia de Mulatier, Valeria V. Krzhizhanovskaya, Peter M.A. Sloot, Maciej Paszynski, Jack J. Dongarra |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 133-149 |
Number of pages | 17 |
ISBN (Print) | 9783031087509 |
DOIs | |
State | Published - 2022 |
Event | 22nd Annual International Conference on Computational Science, ICCS 2022 - London, United Kingdom Duration: Jun 21 2022 → Jun 23 2022 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13350 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 22nd Annual International Conference on Computational Science, ICCS 2022 |
---|---|
Country/Territory | United Kingdom |
City | London |
Period | 06/21/22 → 06/23/22 |
Funding
Acknowledgements. The authors would like to acknowledge University at Buffalo Center for Computational Research for computing resources and financial support of the National Science Foundation Grant numbers NSF/OAC 1339765 and NSF/DMS 1621853.
Keywords
- Anomaly detection
- High-dimensional data
- Large deviations
- Multivariate time series
- Time series database