TY - GEN
T1 - Automatic Speech-Based Charisma Recognition and the Impact of Integrating Auxiliary Characteristics
AU - Kathan, Alexander
AU - Amiriparian, Shahin
AU - Christ, Lukas
AU - Eulitz, Simone
AU - Schuller, Björn W.
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Automatic recognition of speaker's states and traits is crucial to facilitate a more naturalistic human-AI interaction - a key focus in human-computer interaction to enhance user experience. One particularly important trait in daily life is charisma. To date, its definition is still controversial. However, it seems that there are characteristics in speech that the majority perceives as charismatic. To this end, we address the novel speech-based task of charisma recognition in a three-fold approach. First, we predict charismatic speech using both interpretable acoustic features and embeddings of two audio Transformers. Afterwards, we make use of auxiliary labels that are highly correlated with charisma, including enthusiastic, likeable, attractive, warm, and leader-like, to check their impact on charisma recognition. Finally, we personalise the best model, taking individual speech characteristics into account. In our experiments, we demonstrate that the charisma prediction model benefits from integrating auxiliary characteristics as well as from the personalised approach, resulting in a best Pearson's correlation coefficient of 0.4304.
AB - Automatic recognition of speaker's states and traits is crucial to facilitate a more naturalistic human-AI interaction - a key focus in human-computer interaction to enhance user experience. One particularly important trait in daily life is charisma. To date, its definition is still controversial. However, it seems that there are characteristics in speech that the majority perceives as charismatic. To this end, we address the novel speech-based task of charisma recognition in a three-fold approach. First, we predict charismatic speech using both interpretable acoustic features and embeddings of two audio Transformers. Afterwards, we make use of auxiliary labels that are highly correlated with charisma, including enthusiastic, likeable, attractive, warm, and leader-like, to check their impact on charisma recognition. Finally, we personalise the best model, taking individual speech characteristics into account. In our experiments, we demonstrate that the charisma prediction model benefits from integrating auxiliary characteristics as well as from the personalised approach, resulting in a best Pearson's correlation coefficient of 0.4304.
UR - https://www.scopus.com/pages/publications/85217878419
U2 - 10.1109/Telepresence63209.2024.10841640
DO - 10.1109/Telepresence63209.2024.10841640
M3 - Conference contribution
AN - SCOPUS:85217878419
T3 - 2024 IEEE Conference on Telepresence, Telepresence 2024
SP - 148
EP - 153
BT - 2024 IEEE Conference on Telepresence, Telepresence 2024
A2 - Adascalitei, Adrian
A2 - Stoica, Adrian
A2 - Mangini, Agostino
A2 - Mohammadi, Alireza
A2 - Yan, Charlie
A2 - Nemeth, Christopher
A2 - Kaber, David
A2 - Dall'Alba, Diego
A2 - Tunstel, Edward
A2 - Schena, Emiliano
A2 - Atashzar, Farokh
A2 - Sahin, Ferat
A2 - Barresi, Giacinto
A2 - Ren, Hongliang
A2 - van Erp, Jan
A2 - Wu, Jie Ying
A2 - Ymjin, Jin
A2 - Trajkovic, Ljiljana
A2 - Aly, Mohamed
A2 - Deshpande, Nikhil
A2 - Fiorini, Paolo
A2 - Carli, Rahaele
A2 - Nahavandi, Saeid
A2 - Livatino, Salvatore
A2 - Ryu, Seok-Chang
A2 - Falk, Tiago H.
A2 - Gedeon, Tom
A2 - He, Yutao
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 1st IEEE Conference on Telepresence, Telepresence 2024
Y2 - 16 November 2024 through 17 November 2024
ER -