DocumentCode :
1650457
Title :
Using dynamic conditional random field on single-microphone speech separation
Author :
Yu Ting Yeung ; Tan Lee ; Cheung-Chi Leung
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
fYear :
2013
Firstpage :
146
Lastpage :
150
Abstract :
The use of dynamic conditional random field (DCRF) for model-based single-microphone speech separation is investigated. The speech sources are represented by acoustic state sequences from speaker-dependent acoustic models. The posterior probabilities of the source acoustic states given a speech mixture are inferred with a maximum entropy probability distribution which is represented by DCRF. The posterior probabilities are needed for minimum mean-square error estimation of the speech sources. Loopy belief propagation is applied for the inference. Averaged stochastic gradient descent and limited-memory BFGS are compared for parameter estimation. With the log-magnitude spectrum of the speech mixture as input observation, the proposed method achieves better separation performance in terms of Blind Source Separation Metrics (SDR, SAR, SIR) and PESQ than a factorial hidden Markov model baseline system in our experiments.
Keywords :
mean square error methods; speech processing; statistical distributions; acoustic state sequences; averaged stochastic gradient descent; blind source separation metrics; dynamic conditional random field; factorial hidden Markov model baseline system; log-magnitude spectrum; loopy belief propagation; maximum entropy probability distribution; minimum mean-square error estimation; model-based single-microphone speech separation; parameter estimation; posterior probabilities; source acoustic states; speaker-dependent acoustic models; speech mixture; speech sources; Acoustics; Computational modeling; Estimation; Hidden Markov models; Speech; Speech processing; Training; dynamic conditional random field; single-microphone speech separation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6637626
Filename :
6637626
Link To Document :
بازگشت