TY - GEN
T1 - Automated Assessment of Encouragement and Warmth in Classrooms Leveraging Multimodal Emotional Features and ChatGPT
AU - Hou, Ruikun
AU - Fütterer, Tim
AU - Bühler, Babette
AU - Bozkir, Efe
AU - Gerjets, Peter
AU - Trautwein, Ulrich
AU - Kasneci, Enkelejda
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - Classroom observation protocols standardize the assessment of teaching effectiveness and facilitate comprehension of classroom interactions. Whereas these protocols offer teachers specific feedback on their teaching practices, the manual coding by human raters is resource-intensive and often unreliable. This has sparked interest in developing AI-driven, cost-effective methods for automating such holistic coding. Our work explores a multimodal approach to automatically estimating encouragement and warmth in classrooms, a key component of the Global Teaching Insights (GTI) study’s observation protocol. To this end, we employed facial and speech emotion recognition with sentiment analysis to extract interpretable features from video, audio, and transcript data. The prediction task involved both classification and regression methods. Additionally, in light of recent large language models’ remarkable text annotation capabilities, we evaluated ChatGPT’s zero-shot performance on this scoring task based on transcripts. We demonstrated our approach on the GTI dataset, comprising 367 16-min video segments from 92 authentic lesson recordings. The inferences of GPT-4 and the best-trained model yielded correlations of r=.341 and r=.441 with human ratings, respectively. Combining estimates from both models through averaging, an ensemble approach achieved a correlation of r=.513, comparable to human inter-rater reliability. Our model explanation analysis indicated that text sentiment features were the primary contributors to the trained model’s decisions. Moreover, GPT-4 could deliver logical and concrete reasoning as potential teacher guidelines. Our findings provide insights into using multimodal techniques for automated classroom observation, aiming to foster teacher training through frequent and valuable feedback.
AB - Classroom observation protocols standardize the assessment of teaching effectiveness and facilitate comprehension of classroom interactions. Whereas these protocols offer teachers specific feedback on their teaching practices, the manual coding by human raters is resource-intensive and often unreliable. This has sparked interest in developing AI-driven, cost-effective methods for automating such holistic coding. Our work explores a multimodal approach to automatically estimating encouragement and warmth in classrooms, a key component of the Global Teaching Insights (GTI) study’s observation protocol. To this end, we employed facial and speech emotion recognition with sentiment analysis to extract interpretable features from video, audio, and transcript data. The prediction task involved both classification and regression methods. Additionally, in light of recent large language models’ remarkable text annotation capabilities, we evaluated ChatGPT’s zero-shot performance on this scoring task based on transcripts. We demonstrated our approach on the GTI dataset, comprising 367 16-min video segments from 92 authentic lesson recordings. The inferences of GPT-4 and the best-trained model yielded correlations of r=.341 and r=.441 with human ratings, respectively. Combining estimates from both models through averaging, an ensemble approach achieved a correlation of r=.513, comparable to human inter-rater reliability. Our model explanation analysis indicated that text sentiment features were the primary contributors to the trained model’s decisions. Moreover, GPT-4 could deliver logical and concrete reasoning as potential teacher guidelines. Our findings provide insights into using multimodal techniques for automated classroom observation, aiming to foster teacher training through frequent and valuable feedback.
KW - AI in Education
KW - ChatGPT zero-shot annotation
KW - Classroom observation
KW - Multimodal machine learning
KW - Teaching effectiveness
UR - http://www.scopus.com/inward/record.url?scp=85200219030&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-64302-6_5
DO - 10.1007/978-3-031-64302-6_5
M3 - Conference contribution
AN - SCOPUS:85200219030
SN - 9783031643019
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 60
EP - 74
BT - Artificial Intelligence in Education - 25th International Conference, AIED 2024, Proceedings
A2 - Olney, Andrew M.
A2 - Chounta, Irene-Angelica
A2 - Liu, Zitao
A2 - Santos, Olga C.
A2 - Bittencourt, Ig Ibert
PB - Springer Science and Business Media Deutschland GmbH
T2 - 25th International Conference on Artificial Intelligence in Education, AIED 2024
Y2 - 8 July 2024 through 12 July 2024
ER -