DocumentCode :
542230
Title :
Noise-robust speech recognition using a new spectral estimation method “PHASOR”
Author :
Aikawa, Kiyoaki ; Ishizuka, Kentaro
Author_Institution :
NTT Communication Science Laboratories, NTT Corporation, 3-1 Morinosato-Wakamiya, Atsugi-Shi, Kanagawa 243-0198 Japan
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
This paper proposes a new noise-robust spectral estimation method for speech recognition. The new method, called PHASOR, is characterized by inside-frame processing. The speech spectrum is estimated from a single impulse response obtained by summing multiple pitch periods in a frame with synchronizing the phase. PHASOR improves the spectral estimation accuracy and suppresses the additive noise because of the inside-frame processing. These improvement is more effective when the pitch fluctuates or changes in the frame. Speaker-dependent and speaker-independent phoneme recognition experiments demonstrate that the PHASOR greatly reduces the recognition error rate for speech data contaminated by noise. It also outperforms conventional noise reduction methods, cepstral mean normalization and spectral subtraction.
Keywords :
Cepstral analysis; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743738
Filename :
5743738
Link To Document :
بازگشت