DocumentCode :
1749655
Title :
Maximum-likelihood compensation of zero-memory nonlinearities in speech signals
Author :
Morris, Robert W. ; Clements, Mark A.
Author_Institution :
Center for Signal & Image Processing, Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
289
Abstract :
An algorithm to blindly compensate zero-memory nonlinear distortions of speech waveforms is derived and analyzed. This method finds a maximum-likelihood estimate of the distortion without a priori knowledge of the microphone characteristics by using the expectation-maximization algorithm. The autoregressive signal model coefficients are solved jointly with the nonlinearity estimate created by an extended Kalman filter. Also, a new family of nonlinear functions is developed for use with this algorithm, although the method can estimate the shape of any parametric zero-memory nonlinearity. These nonlinear distortions can degrade the speech recognition rates, yet lower the perceptual quality only slightly. The compensation algorithm improves automatic speech recognition of distorted speech for a variety of such nonlinearities
Keywords :
Kalman filters; autoregressive processes; filtering theory; maximum likelihood estimation; nonlinear distortion; nonlinear filters; nonlinear functions; optimisation; speech enhancement; speech intelligibility; speech recognition; automatic speech recognition; autoregressive signal model coefficients; compensation algorithm; distorted speech; expectation-maximization algorithm; extended Kalman filter; maximum-likelihood compensation; maximum-likelihood estimate; microphone characteristics; nonlinear functions; nonlinearity estimate; speech enhancement; speech perceptual quality; speech quality; speech recognition rates; speech signals; speech waveforms; zero-memory nonlinear distortions; Algorithm design and analysis; Automatic speech recognition; Degradation; Expectation-maximization algorithms; Maximum likelihood estimation; Microphones; Nonlinear distortion; Shape; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940824
Filename :
940824
Link To Document :
بازگشت