Towards an LSTM-based approach for detection of temporally anomalous data in medical datasets

Michael Bowie, Edmon Begoli, Byung Park, Jeevith Bopaiah

Research output: Contribution to conferencePaperpeer-review

2 Scopus citations

Abstract

Health-related data is complex, heterogeneous, and frequently temporally discontinuous which makes it difficult to analyze, and even more difficult to assess for quality, detect errors, or address anomalies. Undetected errors and anomalies within medical data, if not properly addressed, can have far reaching humanitarian and/or financial consequences. Many machine learning methodologies applied towards automatic detection of complex errors or anomalies are well known. However, the additional challenge in detection of anomalies in health care data is the so-called temporal or contextual dependency – i.e., the challenge of distinguishing "episodes" of anomalous data that are anomalous either in the context of a particular time window or specific to some particular medical condition. In this paper, we present research in progress on the application to medical data of state-of-the-art anomaly detection techniques based on Long-Short-Term-Memory (LSTM) neural networks. We hypothesize that LSTMs present a robust and flexible approach to the temporal anomaly detection in medical data sets due in part to their ability to retain information about both long-term and short-term dependencies on the model outcome.

Original languageEnglish
StatePublished - 2017
Event22nd MIT International Conference on Information Quality, ICIQ 2017 - Little Rock, United States
Duration: Oct 6 2017Oct 7 2017

Conference

Conference22nd MIT International Conference on Information Quality, ICIQ 2017
Country/TerritoryUnited States
CityLittle Rock
Period10/6/1710/7/17

Funding

The research presented in this paper was supported in part by the joint program MVP CHAMPION between the Department of Veterans Affairs, Veterans Health Administration, Office of Research and Development and the Department of Energy. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). The research presented in this paper was supported in part by the joint program MVP CHAMPION between the Department of Veterans Affairs, Veterans Health Administration, Office of Research and Development and the Department of Energy.

FundersFunder number
DOE Public Access Plan
LLC
UT-Battelle
United States Government
Veterans Health Administration
U.S. Department of Energy
U.S. Department of Veterans Affairs
Office of Research and Development
Health Services Research and Development
Office of Energy Research and Development

    Keywords

    • Anomaly detection
    • LSTM
    • Long-short-term-memory network
    • Medical data
    • Time series

    Fingerprint

    Dive into the research topics of 'Towards an LSTM-based approach for detection of temporally anomalous data in medical datasets'. Together they form a unique fingerprint.

    Cite this