Skip to main navigation Skip to search Skip to main content

Multimodal Engagement Analysis From Facial Videos in the Classroom

  • Omer Sumer
  • , Patricia Goldberg
  • , Sidney Dmello
  • , Peter Gerjets
  • , Ulrich Trautwein
  • , Enkelejda Kasneci
  • University of Tübingen
  • University of Colorado
  • Knowledge Media Research Center

Research output: Contribution to journalArticlepeer-review

113 Scopus citations

Abstract

Student engagement is a key component of learning and teaching, resulting in a plethora of automated methods to measure it. Whereas most of the literature explores student engagement analysis using computer-based learning often in the lab, we focus on using classroom instruction in authentic learning environments. We collected audiovisual recordings of secondary school classes over a one and a half month period, acquired continuous engagement labeling per student (N=15) in repeated sessions, and explored computer vision methods to classify engagement from facial videos. We learned deep embeddings for attentional and affective features by training Attention-Net for head pose estimation and Affect-Net for facial expression recognition using previously-collected large-scale datasets. We used these representations to train engagement classifiers on our data, in individual and multiple channel settings, considering temporal dependencies. The best performing engagement classifiers achieved student-independent AUCs of.620 and.720 for grades 8 and 12, respectively, with attention-based features outperforming affective features. Score-level fusion either improved the engagement classifiers or was on par with the best performing modality. We also investigated the effect of personalization and found that only 60 seconds of person-specific data, selected by margin uncertainty of the base classifier, yielded an average AUC improvement of.084.

Original languageEnglish
Pages (from-to)1012-1027
Number of pages16
JournalIEEE Transactions on Affective Computing
Volume14
Issue number2
DOIs
StatePublished - 1 Apr 2023
Externally publishedYes

Keywords

  • Affective computing
  • computer vision
  • educational technology
  • nonverbal behaviour understanding

Fingerprint

Dive into the research topics of 'Multimodal Engagement Analysis From Facial Videos in the Classroom'. Together they form a unique fingerprint.

Cite this