TY - GEN
T1 - Bioanalog acoustic emotion recognition by genetic feature generation based on low-level-descriptors
AU - Schuller, Björn
AU - Arsić, Dejan
AU - Wallhoff, Frank
AU - Lang, Manfred
AU - Rigoll, Gerhard
PY - 2005
Y1 - 2005
N2 - Affective Computing has grown an important field in today's man-machine-interaction, and the acoustic speech signal is very popular as basis for an automatic classification at the moment. However, recognition performances reported today are mostly not sufficient for a real usage within working systems. Therefore we want to improve on this challenge by evolutionary programming. As a starting point we use prosodic, voice quality and articulatory feature contours. We next propose systematic derivation of function als by means of descriptive statistics. In order to analyze cross-feature information and feature permutations we use Genetic Algorithms, as a complete coverage of possible alterations Is NP-hard. The final attribute set is at the same time optimized by reduction to the most relevant information in order to reduce complexity for the classifier and ensure real-time capability during extraction process. Classification Is fulfilled by diverse machine learning methods for utmost discrimination power. We decided for two public databases, namely the Berlin Emotional Speech Database, and the Danish Emotional Speech Corpus for test-runs. These clearly show the high effectiveness of the suggested approach.
AB - Affective Computing has grown an important field in today's man-machine-interaction, and the acoustic speech signal is very popular as basis for an automatic classification at the moment. However, recognition performances reported today are mostly not sufficient for a real usage within working systems. Therefore we want to improve on this challenge by evolutionary programming. As a starting point we use prosodic, voice quality and articulatory feature contours. We next propose systematic derivation of function als by means of descriptive statistics. In order to analyze cross-feature information and feature permutations we use Genetic Algorithms, as a complete coverage of possible alterations Is NP-hard. The final attribute set is at the same time optimized by reduction to the most relevant information in order to reduce complexity for the classifier and ensure real-time capability during extraction process. Classification Is fulfilled by diverse machine learning methods for utmost discrimination power. We decided for two public databases, namely the Berlin Emotional Speech Database, and the Danish Emotional Speech Corpus for test-runs. These clearly show the high effectiveness of the suggested approach.
KW - Affective computing
KW - Emotion recognition
KW - Genetic feature generation
KW - Speech processing
UR - http://www.scopus.com/inward/record.url?scp=33947309757&partnerID=8YFLogxK
U2 - 10.1109/eurcon.2005.1630194
DO - 10.1109/eurcon.2005.1630194
M3 - Conference contribution
AN - SCOPUS:33947309757
SN - 142440049X
SN - 9781424400492
T3 - EUROCON 2005 - The International Conference on Computer as a Tool
SP - 1292
EP - 1295
BT - EUROCON 2005 - The International Conference on Computer as a Tool
PB - IEEE Computer Society
T2 - EUROCON 2005 - The International Conference on Computer as a Tool
Y2 - 21 November 2005 through 24 November 2005
ER -