Title :
Structured mean field method for single-microphone speech separation with factorial Hidden Markov Model
Author :
Yu Ting Yeung ; Tan Lee
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
Abstract :
A variational statistical inference method referred as structured mean field method is studied for factorial Hidden Markov Model (HMM) formulation of single-microphone speech separation problem. By decoupling the Markov chains of the individual speech sources coupled during the mixing process of the speech mixture, the complexity of temporal inference of the speech sources is reduced to quadratic with the number of acoustic states of the sources. Speech separation and automatic speech recognition experiments are performed on the reconstructed speech. Experimental results show that the studied approximating inference method achieves the similar separation results as the exact inference algorithm in terms of Perceptual Evaluation of Speech Quality (PESQ) and word error rate (WER).
Keywords :
hidden Markov models; inference mechanisms; signal reconstruction; source separation; speech recognition; WER; automatic speech recognition; factorial hidden Markov model; individual speech sources; perceptual evaluation of speech quality; single-microphone speech separation; speech mixture mixing process; speech reconstruction; structured mean field method; temporal inference complexity; word error rate; Acoustics; Approximation algorithms; Hidden Markov models; Inference algorithms; Speech; Speech processing; Speech recognition; factorial HMM; speech separation; structured mean field approximation; variational method;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2013 IEEE China Summit & International Conference on
Conference_Location :
Beijing
DOI :
10.1109/ChinaSIP.2013.6625311