Abstract
Research in the biomedical domain is constantly challenged by its large amount of ever-evolving textual information. Biomedical researchers are usually required to conduct a literature review before any medical intervention to assess the effectiveness of the concerned research. However, the process is time-consuming, and therefore, automation to some extent would help reduce the accompanying information overload. Multi-document summarization of scientific articles for literature reviews is one approximation of such automation. Here in this paper, we describe our pipelined approach for the aforementioned task. We design a BERT-based extractive method followed by a BigBird PEGASUS-based abstractive pipeline for generating literature review summaries from the abstracts of biomedical trial reports as part of the Multi-document Summarization for Literature Review (MSLR) shared task1 in the Scholarly Document Processing (SDP) workshop 20222. Our proposed model achieves the best performance on the MSLR-Cochrane leaderboard3 on majority of the evaluation metrics. Human scrutiny of our automatically generated summaries indicates that our approach is promising to yield readable multi-article summaries for conducting such literature reviews.
| Original language | English |
|---|---|
| Pages (from-to) | 204-209 |
| Number of pages | 6 |
| Journal | Proceedings - International Conference on Computational Linguistics, COLING |
| Volume | 29 |
| Issue number | 9 |
| State | Published - 2022 |
| Externally published | Yes |
| Event | 3rd Workshop on Scholarly Document Processing, SDP 2022 at 29th International Conference on Computational Linguistics, COLING 2022 - Gyeongju, Korea, Republic of Duration: Oct 12 2022 → Oct 17 2022 |
Fingerprint
Dive into the research topics of 'An Extractive-Abstractive Approach for Multi-document Summarization of Scientific Articles for Literature Review'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver