TY - GEN
T1 - Towards automatic intoxication detection from speech in real-life acoustic environments
AU - Zhang, Zixing
AU - Weninger, Felix
AU - Schuller, Björn
N1 - Publisher Copyright:
© VDE VERLAG GMBH • Berlin • Offenbach.
PY - 2012
Y1 - 2012
N2 - In-car intoxication detection from speech is a highly promising non-intrusive method to reduce the accident risk associated with drunk driving. However, in-car noise significantly influences the recognition performance and needs to be addressed in practical applications. In this paper, we investigate how seriously the intrinsic in-car noise and background music affect the accuracy of intoxication recognition. In extensive test runs using the official speech corpus of the INTERSPEECH 2011 Intoxication Challenge, realistic car noise and original popular music we conclude that stationary driving noise as well as music introduce a significant downgrade when acoustic models are trained on clean speech only, which can partly be alleviated by multi-condition training. Besides, exploiting cumulative evidence over time by late decision fusion appears to be a promising way to further enhance performance in noisy conditions.
AB - In-car intoxication detection from speech is a highly promising non-intrusive method to reduce the accident risk associated with drunk driving. However, in-car noise significantly influences the recognition performance and needs to be addressed in practical applications. In this paper, we investigate how seriously the intrinsic in-car noise and background music affect the accuracy of intoxication recognition. In extensive test runs using the official speech corpus of the INTERSPEECH 2011 Intoxication Challenge, realistic car noise and original popular music we conclude that stationary driving noise as well as music introduce a significant downgrade when acoustic models are trained on clean speech only, which can partly be alleviated by multi-condition training. Besides, exploiting cumulative evidence over time by late decision fusion appears to be a promising way to further enhance performance in noisy conditions.
UR - http://www.scopus.com/inward/record.url?scp=84947969272&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84947969272
T3 - Proceedings of 10th ITG Symposium on Speech Communication
BT - Proceedings of 10th ITG Symposium on Speech Communication
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th ITG Symposium on Speech Communication, ITGspeech 2012
Y2 - 26 September 2012 through 28 September 2012
ER -