• DocumentCode
    395187
  • Title

    Distinctive phonetic feature extraction for robust speech recognition

  • Author

    Fukuda, Takashi ; Yamamoto, Wataru ; Nitta, Tsuneo

  • Author_Institution
    Graduate Sch. of Eng., Toyohashi Univ. of Technol., Japan
  • Volume
    2
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    The paper describes an attempt to extract distinctive phonetic features (DPFs) that represent articulatory gestures in linguistic theory by using a multilayer neural network (MLN) and to apply the DPFs to noise-robust speech recognition. In the DPF extraction stage, after converting a speech signal to acoustic features composed of local features (LFs), an MLN with 33 output units, corresponding to context-dependent DPFs of 11 DPFs, 11 preceding context DPFs, and 11 following context DPFs, maps the LFs to DPFs. The proposed DPF parameters without MFCC (Mel-frequency cepstral coefficients) were firstly evaluated in comparison with a standard parameter set of MFCC and dynamic features on a word recognition task using clean speech; the result showed the same performance as that of the standard set. Noise robustness of these parameters was then tested with four types of additive noise and the proposed DPF parameters outperformed the standard set except for one additive noise type.
  • Keywords
    acoustic noise; feature extraction; neural nets; speech recognition; white noise; MFCC; Mel-frequency cepstral coefficients; acoustic noise; additive noise; articulatory gestures; distinctive phonetic feature extraction; linguistic theory; multilayer neural network; robust speech recognition; white noise; Additive noise; Cepstral analysis; Feature extraction; Mel frequency cepstral coefficient; Multi-layer neural network; Neural networks; Noise robustness; Speech analysis; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1202285
  • Filename
    1202285