• DocumentCode
    2311503
  • Title

    Preprocessing and neural classification of English stop consonants [b, d, g, p, t, k]

  • Author

    Esposito, A. ; Ezin, C.E. ; Ceccarelli, M.

  • Author_Institution
    Int. Inst. for Adv. Sci. Studies, Vietri Sul Mare, Italy
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1249
  • Abstract
    Neural networks are accepted as powerful learning tools in pattern recognition in which they proved their performance. Nevertheless, many problems like phoneme classification with a multi-speaker continuous speech database are hard even for neural networks. The authors´ aim is to propose an artificial neural network architecture that detects acoustic features in speech signals and classifies them correctly. They reached this goal with English stop consonants [b, d, g, p, t, k] extracted from the general multi-speaker database (TIMlT) by modifying some parameter values in the preprocessing algorithm and by using a modified TDNN (time delay neural network) architecture. The net performed a good classification giving as testing recognition percentage the following results: 92.9 for [b], 91.8 for [d], 92.4 for [g], 80.3 for [p], 90.2 for [t], 91.2 for [k]
  • Keywords
    feature extraction; neural net architecture; pattern classification; speech recognition; English stop consonants; acoustic feature detection; artificial neural network architecture; learning tools; modified time delay neural network architecture; multi-speaker continuous speech database; neural classification; neural networks; pattern recognition; phoneme classification; preprocessing algorithm; speech signals; Acoustic signal detection; Artificial neural networks; Computer vision; Delay effects; Neural networks; Pattern recognition; Performance evaluation; Spatial databases; Speech; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607835
  • Filename
    607835