A real-time speech enhancement framework for multi-party meetings

Rudy Rotili, Emanuele Principi, Stefano Squartini, Björn Schuller

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This paper proposes a real-time speech enhancement framework working in presence of multiple sources in reverberated environments. The aim is to automatically reduce the distortions introduced by room reverberation in the available distant speech signals and thus to achieve a significant improvement of speech quality for each speaker. The overall framework is composed by three cooperating blocks, each one fulfilling a specific task: speaker diarization, room-impulse response identification and speech dereverberation. In particular the speaker diarization algorithm is essential to pilot the operations performed in the other two stages in accordance with speakers' activity in the room. Extensive computer simulations have been performed by using a subset of the AMI database: Obtained results show the effectiveness of the approach.

Original languageEnglish
Title of host publicationAdvances in Nonlinear Speech Processing - 5th International Conference on Nonlinear Speech Processing, NOLISP 2011, Proceedings
Pages80-87
Number of pages8
DOIs
StatePublished - 2011
Event5th International Conference on Nonlinear Speech Processing, NOLISP 2011 - Las Palmas de Gran Canaria, Spain
Duration: 7 Nov 20119 Nov 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7015 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Conference on Nonlinear Speech Processing, NOLISP 2011
Country/TerritorySpain
CityLas Palmas de Gran Canaria
Period7/11/119/11/11

Keywords

  • Blind Channel Identification
  • Real-time Signal Processing
  • Speaker Diarization
  • Speech Dereverberation
  • Speech Enhancement

Fingerprint

Dive into the research topics of 'A real-time speech enhancement framework for multi-party meetings'. Together they form a unique fingerprint.

Cite this