Understanding and Interpreting the Impact of User Context in Hate Speech Detection

Edoardo Mosca, Maximilian Wich, Georg Groh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

27 Scopus citations

Abstract

As hate speech spreads on social media and online communities, research continues to work on its automatic detection. Recently, recognition performance has been increasing thanks to advances in deep learning and the integration of user features. This work investigates the effects that such features can have on a detection model. Unlike previous research, we show that simple performance comparison does not expose the full impact of including contextualand user information. By leveraging explainability techniques, we show (1) that user features play a role in the model's decision and (2) how they affect the feature space learned by the model. Besides revealing that-and also illustrating why-user features are the reason for performance gains, we show how such techniques can be combined to better understand the model and to detect unintended bias.

Original languageEnglish
Title of host publicationSocialNLP 2021 - 9th International Workshop on Natural Language Processing for Social Media, Proceedings of the Workshop
EditorsLun-Wei Ku, Cheng-Te Li
PublisherAssociation for Computational Linguistics (ACL)
Pages91-102
Number of pages12
ISBN (Electronic)9781954085329
StatePublished - 2021
Event9th International Workshop on Natural Language Processing for Social Media, SocialNLP 2021 - Virtual, Online
Duration: 10 Jun 2021 → …

Publication series

NameSocialNLP 2021 - 9th International Workshop on Natural Language Processing for Social Media, Proceedings of the Workshop

Conference

Conference9th International Workshop on Natural Language Processing for Social Media, SocialNLP 2021
CityVirtual, Online
Period10/06/21 → …

Fingerprint

Dive into the research topics of 'Understanding and Interpreting the Impact of User Context in Hate Speech Detection'. Together they form a unique fingerprint.

Cite this