TY - JOUR
T1 - NLP-based feature extraction for the detection of COVID-19 misinformation videos on YouTube
AU - Medina Serrano, Juan Carlos
AU - Papakyriakopoulos, Orestis
AU - Hegelich, Simon
N1 - Publisher Copyright:
© ACL 2020.All right reserved.
PY - 2020
Y1 - 2020
N2 - We present a simple NLP methodology for detecting COVID-19 misinformation videos on YouTube by leveraging user comments. We use transfer learning pre-trained models to generate a multi-label classifier that can categorize conspiratorial content. We use the percentage of misinformation comments on each video as a new feature for video classification. We show that the inclusion of this feature in simple models yields an accuracy of up to 82.2%. Furthermore, we verify the significance of the feature by performing a Bayesian analysis. Finally, we show that adding the first hundred comments as tf-idf features increases the video classifier accuracy by up to 89.4%.
AB - We present a simple NLP methodology for detecting COVID-19 misinformation videos on YouTube by leveraging user comments. We use transfer learning pre-trained models to generate a multi-label classifier that can categorize conspiratorial content. We use the percentage of misinformation comments on each video as a new feature for video classification. We show that the inclusion of this feature in simple models yields an accuracy of up to 82.2%. Furthermore, we verify the significance of the feature by performing a Bayesian analysis. Finally, we show that adding the first hundred comments as tf-idf features increases the video classifier accuracy by up to 89.4%.
UR - http://www.scopus.com/inward/record.url?scp=85149144271&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85149144271
SN - 0736-587X
JO - Proceedings of the Annual Meeting of the Association for Computational Linguistics
JF - Proceedings of the Annual Meeting of the Association for Computational Linguistics
T2 - 1st Workshop on NLP for COVID-19 at the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020
Y2 - 1 July 2020
ER -