TY - GEN
T1 - A comparative study on sparsity penalties for NMF-based speech separation
T2 - 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
AU - Joder, Cyril
AU - Weninger, Felix
AU - Virette, David
AU - Schuller, Bjorn
PY - 2013/10/18
Y1 - 2013/10/18
N2 - In this work, we study the usefulness of several types of sparsity penalties in the task of speech separation using supervised and semi-supervised Nonnegative Matrix Factorization (NMF). We compare different criteria from the literature to two novel penalty functions based on Wiener Entropy, in a large-scale evaluation on spontaneous speech overlaid by realistic domestic noise, as well as music and stationary environmental noise corpora. The results show that enforcing the sparsity constraint in the separation phase does not improve the perceptual quality. In the learning phase however, it yields a better estimation of the base spectra, especially in the case of supervised NMF, where the proposed criteria delivered the best results.
AB - In this work, we study the usefulness of several types of sparsity penalties in the task of speech separation using supervised and semi-supervised Nonnegative Matrix Factorization (NMF). We compare different criteria from the literature to two novel penalty functions based on Wiener Entropy, in a large-scale evaluation on spontaneous speech overlaid by realistic domestic noise, as well as music and stationary environmental noise corpora. The results show that enforcing the sparsity constraint in the separation phase does not improve the perceptual quality. In the learning phase however, it yields a better estimation of the base spectra, especially in the case of supervised NMF, where the proposed criteria delivered the best results.
KW - Source separation
KW - noise cancellation
KW - single-channel speech enhancement
UR - http://www.scopus.com/inward/record.url?scp=84890482821&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2013.6637770
DO - 10.1109/ICASSP.2013.6637770
M3 - Conference contribution
AN - SCOPUS:84890482821
SN - 9781479903566
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 858
EP - 862
BT - 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
Y2 - 26 May 2013 through 31 May 2013
ER -