NLP-based feature extraction for the detection of COVID-19 misinformation videos on YouTube

Publikation: Beitrag in FachzeitschriftKonferenzartikelBegutachtung

54 Zitate (Scopus)

Abstract

We present a simple NLP methodology for detecting COVID-19 misinformation videos on YouTube by leveraging user comments. We use transfer learning pre-trained models to generate a multi-label classifier that can categorize conspiratorial content. We use the percentage of misinformation comments on each video as a new feature for video classification. We show that the inclusion of this feature in simple models yields an accuracy of up to 82.2%. Furthermore, we verify the significance of the feature by performing a Bayesian analysis. Finally, we show that adding the first hundred comments as tf-idf features increases the video classifier accuracy by up to 89.4%.

OriginalspracheEnglisch
FachzeitschriftProceedings of the Annual Meeting of the Association for Computational Linguistics
PublikationsstatusVeröffentlicht - 2020
Veranstaltung1st Workshop on NLP for COVID-19 at the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020 - Virtual, Online, USA/Vereinigte Staaten
Dauer: 1 Juli 2020 → …

Fingerprint

Untersuchen Sie die Forschungsthemen von „NLP-based feature extraction for the detection of COVID-19 misinformation videos on YouTube“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren