TY - GEN
T1 - GMM-UBM based open-set online speaker diarization
AU - Geiger, Jürgen
AU - Wallhoff, Frank
AU - Rigoll, Gerhard
PY - 2010
Y1 - 2010
N2 - In this paper, we present an open-set online speaker diarization system. The system is based on Gaussian mixture models (GMMs), which are used as speaker models. The system starts with just 3 such models (one each for both genders and one for non-speech) and creates models for individual speakers not till the speakers occur. As more and more speakers appear, more models are created. Our system implicitly performs audio segmentation, speech/non-speech classification, gender recognition and speaker identification. The system is tested with the HUB4-1996 radio broadcast news database.
AB - In this paper, we present an open-set online speaker diarization system. The system is based on Gaussian mixture models (GMMs), which are used as speaker models. The system starts with just 3 such models (one each for both genders and one for non-speech) and creates models for individual speakers not till the speakers occur. As more and more speakers appear, more models are created. Our system implicitly performs audio segmentation, speech/non-speech classification, gender recognition and speaker identification. The system is tested with the HUB4-1996 radio broadcast news database.
KW - Gaussian mixture models
KW - Open-set speaker recognition
KW - Speaker diarization
UR - http://www.scopus.com/inward/record.url?scp=79959841110&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:79959841110
T3 - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
SP - 2330
EP - 2333
BT - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
PB - International Speech Communication Association
ER -