Abstract
This paper presents an application of topic modeling on event sequences of Electronic Health Record (EHR) orders. Metaphorically, we approach clinical order event sequences of unlabeled data as if they are documents where words are the events that occurred in the history of an order. We demonstrate the approach leveraging Consult orders data. The details of the data preprocessing and the data structures are described along with the data sources. Latent Dirichlet Allocation (LDA) is leveraged to fit against the limited datasets prepared. Another open source tool - LDAvis is used for exploratory analysis of the LDA results. The preliminary results revealed some order patterns that are qualitatively evaluated as potential irregular transitions. The goal of this analysis is to provide unsupervised learning application to domain experts in the absence of labeled data where they can investigate captured patterns and identify irregular transitions of orders. Ultimately, such efforts will guide formalization of hazard detection algorithms that monitor EHR data to identify health information technology related hazards.
Original language | English |
---|---|
Title of host publication | 2019 IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2019 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781728108483 |
DOIs | |
State | Published - May 2019 |
Event | 2019 IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2019 - Chicago, United States Duration: May 19 2019 → May 22 2019 |
Publication series
Name | 2019 IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2019 - Proceedings |
---|
Conference
Conference | 2019 IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2019 |
---|---|
Country/Territory | United States |
City | Chicago |
Period | 05/19/19 → 05/22/19 |
Funding
This work is sponsored by the US Department of Veterans Affairs. This manuscript has been authored by UT-Battelle, LLC, under contract DE-AC05-00OR22725 with the US Department of Energy (DOE). The US government retains and the publisher, by accepting the article for publication, acknowledges that the US government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this manuscript, or allow others to do so, for US government purposes. DOE will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan).
Keywords
- Computerized provider records systems
- Electronic health records
- Hazard detection
- Health information technology
- Latent Dirichlet Allocation
- Topic modeling
- Unsupervised learning