DocumentCode
1245772
Title
Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement
Author
Hansen, John H L ; Arslan, Levent M.
Author_Institution
Dept. of Electr. Eng., Duke Univ., Durham, NC, USA
Volume
3
Issue
1
fYear
1995
fDate
1/1/1995 12:00:00 AM
Firstpage
98
Lastpage
104
Abstract
Research has shown that degrading acoustic background noise influences speech quality across phoneme classes in a nonuniform manner. This results in variable quality performance of many speech enhancement algorithms in noisy environments. A phoneme classification procedure is proposed which directs single-channel constrained speech enhancement. The procedure performs broad phoneme class partitioning of noisy speech frames using a continuous mixture hidden Markov model recognizer in conjunction with a perceptually motivated cost-based decision process. Once noisy speech frames are identified, iterative speech enhancement based on all-pole parameter estimation with inter- and intra-frame spectral constraints is employed. The phoneme class-directed enhancement algorithm is evaluated using TIMIT speech data and shown to result in substantial improvement in objective speech quality over a range of signal-to-noise ratios and individual phoneme classes
Keywords
acoustic noise; hidden Markov models; iterative methods; parameter estimation; spectral analysis; speech enhancement; Markov model-based phoneme class partitioning; TIMIT speech data; all-pole parameter estimation; continuous mixture hidden Markov model recognizer; degrading acoustic background noise; improved constrained iterative speech enhancement; noisy environments; objective speech quality; perceptually motivated cost-based decision process; signal-to-noise ratios; single-channel constrained speech enhancement; spectral constraints; speech enhancement algorithms; speech quality; variable quality performance; Acoustic noise; Background noise; Degradation; Hidden Markov models; Iterative algorithms; Partitioning algorithms; Speech analysis; Speech enhancement; Speech processing; Working environment noise;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.365376
Filename
365376
Link To Document