DocumentCode :
45718
Title :
Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain
Author :
Dongwen Ying ; Yonghong Yan
Author_Institution :
Key Lab. of Speech Acoust. & Content Understanding, Inst. of Acoust., Beijing, China
Volume :
21
Issue :
6
fYear :
2013
fDate :
Jun-13
Firstpage :
1145
Lastpage :
1157
Abstract :
The temporal correlation of speech presence/absence is widely used in noise estimation. The most popular technique for exploiting temporal correlation is the smoothing of noisy spectra using a time-recursive filter, in which the forgetting factor is controlled by speech presence probability. However, this technique is not unified into a theoretical framework that enables optimal noise estimation. In theory, hidden Markov models (HMMs) are superior to this technique in modeling temporal correlation. HMMs can model a time sequence of presence/absence of speech signal as a dynamic process of the transition between speech and non-speech states. Moreover, a number of methods, such as maximum likelihood, are available for optimal estimation of HMM parameters. This paper presents a constrained sequential HMM for modeling the log-power sequence on each frequency band. The emission probability of each HMM state is represented by a Gaussian model. The Gaussian mean of the non-speech state is considered as the optimal estimate of noise logarithmic power. The HMM parameter set is sequentially estimated from one frame to another on the basis of maximum likelihood. The proposed method is compared with well-established algorithms through various experiments. Our method delivers more accurate results and does not rely on the assumption of the “non-speech signal onset” as do most algorithms.
Keywords :
Gaussian processes; hidden Markov models; maximum likelihood sequence estimation; recursive filters; smoothing methods; speech processing; Gaussian mean; Gaussian model; HMM parameter optimal estimation; HMM state emission probability; constrained sequential hidden Markov model; dynamic process; forgetting factor; log-power sequence modeling; log-spectral domain; maximum likelihood; noise logarithmic power optimal estimation; noisy spectra smoothing; nonspeech state; speech presence probability; speech signal absence; speech signal presence; speech state; temporal correlation; time-recursive filter; Correlation; Estimation; Hidden Markov models; Noise; Noise measurement; Speech; Speech processing; Constraints; noise estimation; sequential hidden Markov model; speech presence probability; temporal correlation;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2013.2245648
Filename :
6451162
Link To Document :
بازگشت