An Estimation of Online Video User Engagement From Features of Time- and Value-Continuous, Dimensional Emotions

Lukas Stappen, Alice Baird, Michelle Lienhart, Annalena Bätz, Björn Schuller

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Portraying emotion and trustworthiness is known to increase the appeal of video content. However, the causal relationship between these signals and online user engagement is not well understood. This limited understanding is partly due to a scarcity in emotionally annotated data and the varied modalities which express user engagement online. In this contribution, we utilize a large dataset of YouTube review videos which includes ca. 600 h of dimensional arousal, valence and trustworthiness annotations. We investigate features extracted from these signals against various user engagement indicators including views, like/dislike ratio, as well as the sentiment of comments. In doing so, we identify the positive and negative influences which single features have, as well as interpretable patterns in each dimension which relate to user engagement. Our results demonstrate that smaller boundary ranges and fluctuations for arousal lead to an increase in user engagement. Furthermore, the extracted time-series features reveal significant (p < 0.05) correlations for each dimension, such as, count below signal mean (arousal), number of peaks (valence), and absolute energy (trustworthiness). From this, an effective combination of features is outlined for approaches aiming to automatically predict several user engagement indicators. In a user engagement prediction paradigm we compare all features against semi-automatic (cross-task), and automatic (task-specific) feature selection methods. These selected feature sets appear to outperform the usage of all features, e.g., using all features achieves 1.55 likes per day (Lp/d) mean absolute error from valence; this improves through semi-automatic and automatic selection to 1.33 and 1.23 Lp/d, respectively (data mean 9.72 Lp/d with a std. 28.75 Lp/d).

Original languageEnglish
Article number773154
JournalFrontiers in Computer Science
Volume4
DOIs
StatePublished - 23 Mar 2022
Externally publishedYes

Keywords

  • YouTube
  • affective computing
  • continuous emotion annotation
  • explainable machine learning
  • popularity of videos
  • user engagement

Fingerprint

Dive into the research topics of 'An Estimation of Online Video User Engagement From Features of Time- and Value-Continuous, Dimensional Emotions'. Together they form a unique fingerprint.

Cite this