Multimodal representation learning for predicting molecule–disease relations

Jun Wen, Xiang Zhang, Everett Rush, Vidul A. Panickan, Xingyu Li, Tianrun Cai, Doudou Zhou, Yuk Lam Ho, Lauren Costa, Edmon Begoli, Chuan Hong, J. Michael Gaziano, Kelly Cho, Junwei Lu, Katherine P. Liao, Marinka Zitnik, Tianxi Cai

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Motivation: Predicting molecule–disease indications and side effects is important for drug development and pharmacovigilance. Comprehensively mining molecule–molecule, molecule–disease and disease–disease semantic dependencies can potentially improve prediction performance. Methods: We introduce a Multi-Modal REpresentation Mapping Approach to Predicting molecular-disease relations (M2REMAP) by incorporating clinical semantics learned from electronic health records (EHR) of 12.6 million patients. Specifically, M2REMAP first learns a multimodal molecule representation that synthesizes chemical property and clinical semantic information by mapping molecule chemicals via a deep neural network onto the clinical semantic embedding space shared by drugs, diseases and other common clinical concepts. To infer molecule–disease relations, M2REMAP combines multimodal molecule representation and disease semantic embedding to jointly infer indications and side effects. Results: We extensively evaluate M2REMAP on molecule indications, side effects and interactions. Results show that incorporating EHR embeddings improves performance significantly, for example, attaining an improvement over the baseline models by 23.6% in PRC-AUC on indications and 23.9% on side effects. Further, M2REMAP overcomes the limitation of existing methods and effectively predicts drugs for novel diseases and emerging pathogens.

Original languageEnglish
Article numberbtad085
JournalBioinformatics
Volume39
Issue number2
DOIs
StatePublished - Feb 1 2023

Funding

Part of this research is based on data from the Million Veteran Program, Office of Research and Development, Veterans Health Administration and was supported by award [#MVP000]. KPL is supported by the NIH P30 AR072577. This publication does not represent the views of the Department of Veteran Affairs or the United States Government.

Fingerprint

Dive into the research topics of 'Multimodal representation learning for predicting molecule–disease relations'. Together they form a unique fingerprint.

Cite this