DocumentCode :
1463982
Title :
Source–Filter-Based Single-Channel Speech Separation Using Pitch Information
Author :
Stark, Michael ; Wohlmayr, Michael ; Pernkopf, Franz
Author_Institution :
Signal Process. & Speech Commun. Lab., Graz Univ. of Technol., Graz, Austria
Volume :
19
Issue :
2
fYear :
2011
Firstpage :
242
Lastpage :
255
Abstract :
In this paper, we investigate the source-filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the final combination of the source and filter model results in an utterance dependent model that finally enables speaker independent source separation. The contributions of the paper are the multi-pitch tracker, the gain estimation for the VQ based method which accounts for different mixing levels, and a fast approximation for the likelihood computation. Additionally, a linear relationship between pitch tracking performance and speech separation performance is shown.
Keywords :
filtering theory; hidden Markov models; speech processing; vector quantisation; factorial hidden Markov model; gain estimation; likelihood computation; model-driven method; multipitch estimation; multipitch tracker; nonnegative matrix factorization; pitch information; single-channel speech separation; source-filter-based approach; speaker independent source separation; vector quantization; vocal tract filter modeling; Frequency; Hidden Markov models; Humans; Image analysis; Information filtering; Information filters; Matrix decomposition; Source separation; Speech; Vector quantization; Single-channel speech separation (SCSS); multi-pitch estimation; source–filter representation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2047419
Filename :
5443705
Link To Document :
بازگشت