TY - GEN
T1 - Topic modeling to discern irregular order patterns in unlabeled electronic health records
AU - Ozmen, Ozgur
AU - Klasky, Hilda B.
AU - Omitaomu, Olufemi A.
AU - Olama, Mohammed M.
AU - Kuruganti, Teja
AU - Pullum, Laura
AU - Ward, Merry
AU - Scott, Jeanie M.
AU - Laurio, Angela
AU - Nebeker, Jonathan
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/5
Y1 - 2019/5
N2 - This paper presents an application of topic modeling on event sequences of Electronic Health Record (EHR) orders. Metaphorically, we approach clinical order event sequences of unlabeled data as if they are documents where words are the events that occurred in the history of an order. We demonstrate the approach leveraging Consult orders data. The details of the data preprocessing and the data structures are described along with the data sources. Latent Dirichlet Allocation (LDA) is leveraged to fit against the limited datasets prepared. Another open source tool - LDAvis is used for exploratory analysis of the LDA results. The preliminary results revealed some order patterns that are qualitatively evaluated as potential irregular transitions. The goal of this analysis is to provide unsupervised learning application to domain experts in the absence of labeled data where they can investigate captured patterns and identify irregular transitions of orders. Ultimately, such efforts will guide formalization of hazard detection algorithms that monitor EHR data to identify health information technology related hazards.
AB - This paper presents an application of topic modeling on event sequences of Electronic Health Record (EHR) orders. Metaphorically, we approach clinical order event sequences of unlabeled data as if they are documents where words are the events that occurred in the history of an order. We demonstrate the approach leveraging Consult orders data. The details of the data preprocessing and the data structures are described along with the data sources. Latent Dirichlet Allocation (LDA) is leveraged to fit against the limited datasets prepared. Another open source tool - LDAvis is used for exploratory analysis of the LDA results. The preliminary results revealed some order patterns that are qualitatively evaluated as potential irregular transitions. The goal of this analysis is to provide unsupervised learning application to domain experts in the absence of labeled data where they can investigate captured patterns and identify irregular transitions of orders. Ultimately, such efforts will guide formalization of hazard detection algorithms that monitor EHR data to identify health information technology related hazards.
KW - Computerized provider records systems
KW - Electronic health records
KW - Hazard detection
KW - Health information technology
KW - Latent Dirichlet Allocation
KW - Topic modeling
KW - Unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85073012303&partnerID=8YFLogxK
U2 - 10.1109/BHI.2019.8834659
DO - 10.1109/BHI.2019.8834659
M3 - Conference contribution
AN - SCOPUS:85073012303
T3 - 2019 IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2019 - Proceedings
BT - 2019 IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2019 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 IEEE EMBS International Conference on Biomedical and Health Informatics, BHI 2019
Y2 - 19 May 2019 through 22 May 2019
ER -