Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

Jan Philip Wahle, Nischal Ashok, Terry Ruas, Norman Meuschke, Tirthankar Ghosal, Bela Gipp

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

13 Scopus citations

Abstract

A drastic rise in potentially life-threatening misinformation has been a by-product of the COVID-19 pandemic. Computational support to identify false information within the massive body of data on the topic is crucial to prevent harm. Researchers proposed many methods for flagging online misinformation related to COVID-19. However, these methods predominantly target specific content types (e.g., news) or platforms (e.g., Twitter). The methods’ capabilities to generalize were largely unclear so far. We evaluate fifteen Transformer-based models on five COVID-19 misinformation datasets that include social media posts, news articles, and scientific papers to fill this gap. We show tokenizers and models tailored to COVID-19 data do not provide a significant advantage over general-purpose ones. Our study provides a realistic assessment of models for detecting COVID-19 misinformation. We expect that evaluating a broad spectrum of datasets and models will benefit future research in developing misinformation detection systems.

Original languageEnglish
Title of host publicationInformation for a Better World
Subtitle of host publicationShaping the Global Future - 17th International Conference, iConference 2022, Proceedings
EditorsMalte Smits
PublisherSpringer Science and Business Media Deutschland GmbH
Pages381-392
Number of pages12
ISBN (Print)9783030969561
DOIs
StatePublished - 2022
Event17th International Conference on Information for a Better World: Shaping the Global Future, iConference 2022 - Virtual, Online
Duration: Feb 28 2022Mar 4 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13192 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th International Conference on Information for a Better World: Shaping the Global Future, iConference 2022
CityVirtual, Online
Period02/28/2203/4/22

Keywords

  • COVID-19
  • Health
  • Social media
  • Transformers

Fingerprint

Dive into the research topics of 'Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection'. Together they form a unique fingerprint.

Cite this