BioADAPT-MRC: adversarial learning-based domain adaptation improves biomedical machine reading comprehension task

Maria Mahbub, Sudarshan Srinivasan, Edmon Begoli, Gregory D. Peterson

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Motivation: Biomedical machine reading comprehension (biomedical-MRC) aims to comprehend complex biomedical narratives and assist healthcare professionals in retrieving information from them. The high performance of modern neural network-based MRC systems depends on high-quality, large-scale, human-annotated training datasets. In the biomedical domain, a crucial challenge in creating such datasets is the requirement for domain knowledge, inducing the scarcity of labeled data and the need for transfer learning from the labeled general-purpose (source) domain to the biomedical (target) domain. However, there is a discrepancy in marginal distributions between the general-purpose and biomedical domains due to the variances in topics. Therefore, direct-transferring of learned representations from a model trained on a general-purpose domain to the biomedical domain can hurt the model's performance. Results: We present an adversarial learning-based domain adaptation framework for the biomedical machine reading comprehension task (BioADAPT-MRC), a neural network-based method to address the discrepancies in the marginal distributions between the general and biomedical domain datasets. BioADAPT-MRC relaxes the need for generating pseudo labels for training a well-performing biomedical-MRC model. We extensively evaluate the performance of BioADAPT-MRC by comparing it with the best existing methods on three widely used benchmark biomedical-MRC datasets - BioASQ-7b, BioASQ-8b and BioASQ-9b. Our results suggest that without using any synthetic or human-annotated data from the biomedical domain, BioADAPT-MRC can achieve state-of-the-art performance on these datasets.

Original languageEnglish
Pages (from-to)4369-4379
Number of pages11
JournalBioinformatics
Volume38
Issue number18
DOIs
StatePublished - Sep 15 2022

Funding

This work was supported by Department of Veterans Affairs, VHA Office of Mental Health and Suicide Prevention. This work has been authored by UTBattelle, LLC under Contract No. [DE- AC05-00OR22725] with the US Department of Energy. This research used resources of the Knowledge Discovery Infrastructure at the Oak Ridge National Laboratory, which is supported by the Office of Science of the US Department of Energy under Contract No. [DE-AC05- 00OR22725]; and the Department of Veterans Affairs Office of Information Technology Inter-Agency Agreement with the Department of Energy under IAA No. [VA118-16-M-1062].

Fingerprint

Dive into the research topics of 'BioADAPT-MRC: adversarial learning-based domain adaptation improves biomedical machine reading comprehension task'. Together they form a unique fingerprint.

Cite this