Title :
A new metric for VQ-based speech enhancement and separation
Author :
Christensen, Mads Græsbøll ; Mowlaee, Pejman
Author_Institution :
Dept. of Arch., Design & Media Technol., Aalborg Univ., Aalborg, Denmark
Abstract :
Speech enhancement and separation algorithms frequently employ two-stage processing schemes, where the signal is first mapped to an intermediate low-dimensional parametric description. Then, these parameters are mapped to vectors in codebooks trained on individual noise-free sources using a vector quantizer. To obtain accurate parameters, one must employ an estimator that takes the signal characteristics into account. An open question is, however, how to derive metrics for use in the vector quantization process. In this paper, we present and derive a new metric aimed at exactly this, and we exemplify and demonstrate its use in sinusoidal modeling. The metric takes into account that parameters may have different uncertainties and dependencies associated with them and thus leads to more accurate estimates, as is demonstrated in experiments. Moreover, we incorporate the metric in a recently proposed speech separation algorithm and compare its performance to state-of-the-art methods.
Keywords :
speech coding; speech enhancement; vector quantisation; VQ-based speech enhancement; codebooks; noise-free sources; speech separation; vector quantization process; vector quantizer; Measurement; Signal to noise ratio; Speech; Speech coding; Speech enhancement; Vector quantization; Speech processing; speech enhancement; vector quantization;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947420