Abstract
Time series anomaly detection is frequently used to identify extreme behaviors within a single time series. Identifying extreme trends in relation to a collection of other time series, on the other hand, is frequently of significant interest, such as in public health policy, social justice, and pandemic propagation. Using concepts from large deviations theory, we propose an algorithm that can scale to large collections of time series data. This paper expands on the LAD algorithm presented in Guggilam et al. (2022). The proposed algorithm is an online anomaly detection method for identifying anomalies in a collection of multivariate time series that takes advantage of the algorithm's ability to scale to high-dimensional data. We show how the proposed Large Deviations Anomaly Detection (LAD) algorithm can be used to identify regions with anomalous trends in COVID-19 cases, deaths, biweekly growth rates, vaccinations, and fatality rates. Several of the observed anomalous trends are associated with regions that have demonstrated poor response to the COVID pandemic.
Original language | English |
---|---|
Article number | 102101 |
Journal | Journal of Computational Science |
Volume | 72 |
DOIs | |
State | Published - Sep 2023 |
Funding
The authors would like to acknowledge University at Buffalo Center for Computational Research for computing resources and financial support of the National Science Foundation, United States Grant numbers NSF/OAC 1339765 and NSF/DMS 1621853 .
Funders | Funder number |
---|---|
University at Buffalo Center for Computational Research | |
National Science Foundation | |
Division of Mathematical Sciences | 1621853 |
Ohio Arts Council | 1339765 |
Keywords
- Anomaly detection
- High-dimensional data
- Large deviations
- Multivariate time series
- Time series database