Skip to main navigation Skip to search Skip to main content

Towards Multimodal Prediction of Spontaneous Humor: A Novel Dataset and First Results

  • Lukas Christ
  • , Shahin Amiriparian
  • , Alexander Kathan
  • , Niklas Muller
  • , Andreas Konig
  • , Bjorn W. Schuller
  • University Hospital Augsburg
  • Technical University of Munich
  • Universität Passau
  • Imperial College London

Research output: Contribution to journalArticlepeer-review

6 Scopus citations

Abstract

Humor is a substantial element of human social behavior, affect, and cognition. Its automatic understanding can facilitate a more naturalistic human-AI interaction. Current methods of humor detection have been exclusively based on staged data, making them inadequate for ‘real-world’ applications. We contribute to addressing this deficiency by introducing the novel Passau-Spontaneous Football Coach Humor (Passau-SFCH) dataset, comprising about 11 hours of recordings. The Passau-SFCH dataset is annotated for the presence of humor and its dimensions (sentiment and direction) as proposed in Martin's Humor Style Questionnaire. We conduct a series of experiments employing pretrained Transformers, convolutional neural networks, and expert-designed features. The performance of each modality (text, audio, video) for spontaneous humor recognition is analyzed and their complementarity is investigated. Our findings suggest that for the automatic analysis of humor and its sentiment, facial expressions are most promising, while humor direction can be best modeled via text-based features. Further, we experiment with different multimodal approaches to humor recognition, including decision-level fusion and MulT, a multimodal Transformer approach. In this context, we propose a novel multimodal architecture that yields the best overall results.

Original languageEnglish
Pages (from-to)844-860
Number of pages17
JournalIEEE Transactions on Affective Computing
Volume16
Issue number2
DOIs
StatePublished - 2025

Keywords

  • Affective computing
  • computer audition
  • computer vision
  • dataset
  • humor
  • multimedia
  • natural language processing
  • sentiment analysis

Fingerprint

Dive into the research topics of 'Towards Multimodal Prediction of Spontaneous Humor: A Novel Dataset and First Results'. Together they form a unique fingerprint.

Cite this