Cooperative learning and its application to emotion recognition from speech

Zixing Zhang, Eduardo Coutinho, Jun Deng, Björn Schuller

Research output: Contribution to journalArticlepeer-review

102 Scopus citations

Abstract

In this paper, we propose a novel method for highly efficient exploitation of unlabeled data-Cooperative Learning. Our approach consists of combining Active Learning and Semi-Supervised Learning techniques, with the aim of reducing the costly effects of human annotation. The core underlying idea of Cooperative Learning is to share the labeling work between human and machine efficiently in such a way that instances predicted with insufficient confidence value are subject to human labeling, and those with high confidence values are machine labeled. We conducted various test runs on two emotion recognition tasks with a variable number of initial supervised training instances and two different feature sets. The results show that Cooperative Learning consistently outperforms individual Active and Semi-Supervised Learning techniques in all test cases. In particular, we show that our method based on the combination of Active Learning and Co-Training leads to the same performance of a model trained on the whole training set, but using 75% fewer labeled instances. Therefore, our method efficiently and robustly reduces the need for human annotations.

Original languageEnglish
Pages (from-to)115-126
Number of pages12
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume23
Issue number1
DOIs
StatePublished - 1 Jan 2015

Keywords

  • Acoustics
  • active learning
  • cooperative learning
  • emotion recognition
  • multi-view learning
  • semi-supervised learning
  • supervised learning

Fingerprint

Dive into the research topics of 'Cooperative learning and its application to emotion recognition from speech'. Together they form a unique fingerprint.

Cite this