The voice of leadership: Models and performances of automatic analysis in online speeches

Felix Weninger, Jarek Krajewski, Anton Batliner, Björn Schuller

Research output: Contribution to journalArticlepeer-review

35 Scopus citations


We introduce the automatic determination of leadership emergence by acoustic and linguistic features in online speeches. Full realism is provided by the varying and challenging acoustic conditions of the presented YouTube corpus of online available speeches labeled by 10 raters and by processing that includes Long Short-Term Memory-based robust voice activity detection (VAD) and automatic speech recognition (ASR) prior to feature extraction. We discuss cluster-preserving scaling of 10 original dimensions for discrete and continuous task modeling, ground truth establishment, and appropriate feature extraction for this novel speaker trait analysis paradigm. In extensive classification and regression runs, different temporal chunkings and optimal late fusion strategies (LFSs) of feature streams are presented. In the result, achievers, charismatic speakers, and teamplayers can be recognized significantly above chance level, reaching up to 72.5 percent accuracy on unseen test data.

Original languageEnglish
Article number6212433
Pages (from-to)496-508
Number of pages13
JournalIEEE Transactions on Affective Computing
Issue number4
StatePublished - 2012


  • Acoustic/linguistic fusion
  • Dimensional analysis
  • Personality analysis


Dive into the research topics of 'The voice of leadership: Models and performances of automatic analysis in online speeches'. Together they form a unique fingerprint.

Cite this