TY - GEN
T1 - Ten recent trends in computational paralinguistics
AU - Schuller, Björn
AU - Weninger, Felix
PY - 2012
Y1 - 2012
N2 - The field of computational paralinguistics is currently emerging from loosely connected research on speaker states, traits, and vocal behaviour. Starting from a broad perspective on the state-of-the-art in this field, we combine these facts with a bit of 'tea leaf reading' to identify ten currently dominant trends that might also characterise the next decade of research: taking into account more tasks and task interdependencies, modelling paralinguistic information in the continuous domain, agglomerating and evaluating on large amounts of heterogeneous data, exploiting more and more types of features, fusing linguistic and non-linguistic phenomena, devoting more effort to optimisation of the machine learning aspects, standardising the whole processing chain, addressing robustness and security of systems, proceeding to evaluation in real-life conditions, and finally overcoming cross-language and cross-cultural barriers. We expect that following these trends we will see an increase in the 'social competence' of tomorrow's speech and language processing systems.
AB - The field of computational paralinguistics is currently emerging from loosely connected research on speaker states, traits, and vocal behaviour. Starting from a broad perspective on the state-of-the-art in this field, we combine these facts with a bit of 'tea leaf reading' to identify ten currently dominant trends that might also characterise the next decade of research: taking into account more tasks and task interdependencies, modelling paralinguistic information in the continuous domain, agglomerating and evaluating on large amounts of heterogeneous data, exploiting more and more types of features, fusing linguistic and non-linguistic phenomena, devoting more effort to optimisation of the machine learning aspects, standardising the whole processing chain, addressing robustness and security of systems, proceeding to evaluation in real-life conditions, and finally overcoming cross-language and cross-cultural barriers. We expect that following these trends we will see an increase in the 'social competence' of tomorrow's speech and language processing systems.
KW - Computational paralinguistics
KW - machine learning
KW - speaker classification
KW - speech analysis
UR - http://www.scopus.com/inward/record.url?scp=84870343856&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-34584-5_3
DO - 10.1007/978-3-642-34584-5_3
M3 - Conference contribution
AN - SCOPUS:84870343856
SN - 9783642345838
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 35
EP - 49
BT - Cognitive Behavioural Systems - COST 2102 International Training School, Revised Selected Papers
T2 - International Training School on Cognitive Behavioural Systems, COST 2102
Y2 - 21 February 2011 through 26 February 2011
ER -