Title :
Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement
Author :
Hansen, John H L ; Arslan, Levent M.
Author_Institution :
Dept. of Electr. Eng., Duke Univ., Durham, NC, USA
fDate :
1/1/1995 12:00:00 AM
Abstract :
Research has shown that degrading acoustic background noise influences speech quality across phoneme classes in a nonuniform manner. This results in variable quality performance of many speech enhancement algorithms in noisy environments. A phoneme classification procedure is proposed which directs single-channel constrained speech enhancement. The procedure performs broad phoneme class partitioning of noisy speech frames using a continuous mixture hidden Markov model recognizer in conjunction with a perceptually motivated cost-based decision process. Once noisy speech frames are identified, iterative speech enhancement based on all-pole parameter estimation with inter- and intra-frame spectral constraints is employed. The phoneme class-directed enhancement algorithm is evaluated using TIMIT speech data and shown to result in substantial improvement in objective speech quality over a range of signal-to-noise ratios and individual phoneme classes
Keywords :
acoustic noise; hidden Markov models; iterative methods; parameter estimation; spectral analysis; speech enhancement; Markov model-based phoneme class partitioning; TIMIT speech data; all-pole parameter estimation; continuous mixture hidden Markov model recognizer; degrading acoustic background noise; improved constrained iterative speech enhancement; noisy environments; objective speech quality; perceptually motivated cost-based decision process; signal-to-noise ratios; single-channel constrained speech enhancement; spectral constraints; speech enhancement algorithms; speech quality; variable quality performance; Acoustic noise; Background noise; Degradation; Hidden Markov models; Iterative algorithms; Partitioning algorithms; Speech analysis; Speech enhancement; Speech processing; Working environment noise;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on