TY - GEN
T1 - A real-time speech enhancement framework for multi-party meetings
AU - Rotili, Rudy
AU - Principi, Emanuele
AU - Squartini, Stefano
AU - Schuller, Björn
PY - 2011
Y1 - 2011
N2 - This paper proposes a real-time speech enhancement framework working in presence of multiple sources in reverberated environments. The aim is to automatically reduce the distortions introduced by room reverberation in the available distant speech signals and thus to achieve a significant improvement of speech quality for each speaker. The overall framework is composed by three cooperating blocks, each one fulfilling a specific task: speaker diarization, room-impulse response identification and speech dereverberation. In particular the speaker diarization algorithm is essential to pilot the operations performed in the other two stages in accordance with speakers' activity in the room. Extensive computer simulations have been performed by using a subset of the AMI database: Obtained results show the effectiveness of the approach.
AB - This paper proposes a real-time speech enhancement framework working in presence of multiple sources in reverberated environments. The aim is to automatically reduce the distortions introduced by room reverberation in the available distant speech signals and thus to achieve a significant improvement of speech quality for each speaker. The overall framework is composed by three cooperating blocks, each one fulfilling a specific task: speaker diarization, room-impulse response identification and speech dereverberation. In particular the speaker diarization algorithm is essential to pilot the operations performed in the other two stages in accordance with speakers' activity in the room. Extensive computer simulations have been performed by using a subset of the AMI database: Obtained results show the effectiveness of the approach.
KW - Blind Channel Identification
KW - Real-time Signal Processing
KW - Speaker Diarization
KW - Speech Dereverberation
KW - Speech Enhancement
UR - http://www.scopus.com/inward/record.url?scp=81155133357&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-25020-0_11
DO - 10.1007/978-3-642-25020-0_11
M3 - Conference contribution
AN - SCOPUS:81155133357
SN - 9783642250194
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 80
EP - 87
BT - Advances in Nonlinear Speech Processing - 5th International Conference on Nonlinear Speech Processing, NOLISP 2011, Proceedings
T2 - 5th International Conference on Nonlinear Speech Processing, NOLISP 2011
Y2 - 7 November 2011 through 9 November 2011
ER -