Abstract
A significant portion of the working population has their mainstream interaction virtually these days. Meetings are being organized and recorded daily in volumes likely exceeding what can be ever comprehended. With the deluge of meetings, it is important to identify and jot down the essential items discussed in the meeting, usually referred to as the minutes. The task of minuting is diverse and depends on the goals, style, procedure, and category of the meeting. Automatic Minuting is close to summarization; however, not exactly the same. In this work, we evaluate the current state-of-the-art summarization models for automatically generating meeting minutes. We provide empirical baselines to motivate the community to work on this very timely, relevant yet challenging problem. We conclude that off-the-shelf text summarization models are not the best candidates for generating minutes which calls for further research on meeting-specific summarization or minuting models. We found that Transformer-based models perform comparatively better than other categories of summarization algorithms; however, they are still far from generating a good multi-party meeting summary/minutes. We release our experimental code at https://github.com/ELITR/ Minuting_Baseline_Experiments.
| Original language | English |
|---|---|
| Pages | 50-60 |
| Number of pages | 11 |
| State | Published - 2021 |
| Externally published | Yes |
| Event | 35th Pacific Asia Conference on Language, Information and Computation, PACLIC 2021 - Shanghai, China Duration: Nov 5 2021 → Nov 7 2021 |
Conference
| Conference | 35th Pacific Asia Conference on Language, Information and Computation, PACLIC 2021 |
|---|---|
| Country/Territory | China |
| City | Shanghai |
| Period | 11/5/21 → 11/7/21 |
Funding
This work has received funding from the European Union’s Horizon 2020 Research and Innovation Pro-gramme under Grant Agreement No 825460 (ELITR) and the grant 19-26934X (NEUREM3) of the Czech Science Foundation.