DocumentCode :
395187
Title :
Distinctive phonetic feature extraction for robust speech recognition
Author :
Fukuda, Takashi ; Yamamoto, Wataru ; Nitta, Tsuneo
Author_Institution :
Graduate Sch. of Eng., Toyohashi Univ. of Technol., Japan
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
The paper describes an attempt to extract distinctive phonetic features (DPFs) that represent articulatory gestures in linguistic theory by using a multilayer neural network (MLN) and to apply the DPFs to noise-robust speech recognition. In the DPF extraction stage, after converting a speech signal to acoustic features composed of local features (LFs), an MLN with 33 output units, corresponding to context-dependent DPFs of 11 DPFs, 11 preceding context DPFs, and 11 following context DPFs, maps the LFs to DPFs. The proposed DPF parameters without MFCC (Mel-frequency cepstral coefficients) were firstly evaluated in comparison with a standard parameter set of MFCC and dynamic features on a word recognition task using clean speech; the result showed the same performance as that of the standard set. Noise robustness of these parameters was then tested with four types of additive noise and the proposed DPF parameters outperformed the standard set except for one additive noise type.
Keywords :
acoustic noise; feature extraction; neural nets; speech recognition; white noise; MFCC; Mel-frequency cepstral coefficients; acoustic noise; additive noise; articulatory gestures; distinctive phonetic feature extraction; linguistic theory; multilayer neural network; robust speech recognition; white noise; Additive noise; Cepstral analysis; Feature extraction; Mel frequency cepstral coefficient; Multi-layer neural network; Neural networks; Noise robustness; Speech analysis; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202285
Filename :
1202285
Link To Document :
بازگشت