Magnitude spectrum enhancement for robust speech recognition

Author

Tu, Wen-Hsiang ; Hung, Jeih-weih

Author_Institution

Dept of Electr. Eng., Nat. Chi Nan Univ., Taiwan

fYear

2010

fDate

14-19 March 2010

Firstpage

4586

Lastpage

4589

Abstract

In this paper, an effective compensation scheme for the spectra of speech signals is proposed in order to improve their noise robustness. In this compensation scheme, named magnitude spectrum enhancement (MSE), a voice activity detection (VAD) process is first processed for the frame sequence of the utterance, and then the magnitude spectra of non-speech frames are set to be small while those of speech frames are amplified. In experiments conducted on the Aurora-2 noisy digits database, MSE achieves a relative error reduction rate of nearly 50% from the baseline processing, which outperforms the well-known spectral-domain speech enhancement techniques, spectral subtraction (SS) and Wiener filtering (WF). In addition, the proposed MSE can be integrated with cepstral-domain robustness methods, like mean and variance normalization (MVN) and histogram normalization (HEQ), to achieve further improved recognition accuracy under noise-corrupted environments.

Keywords

Wiener filters; spectral analysis; speech enhancement; speech recognition; Aurora-2 noisy digit database; Wiener filtering; cepstral-domain robustness method; frame sequence; histogram normalization; magnitude spectrum enhancement; mean and variance normalization; noise robustness; robust speech recognition; spectral subtraction; spectral-domain speech enhancement; speech signal; voice activity detection; Cepstral analysis; Electronic mail; Histograms; Mel frequency cepstral coefficient; Noise robustness; Speech enhancement; Speech processing; Speech recognition; Wiener filter; Working environment noise; robust speech features; speech enhancement; speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location

Dallas, TX

ISSN

1520-6149

Print_ISBN

978-1-4244-4295-9

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2010.5495556

Filename

5495556