Title :
Speaker-independent voiced-stop-consonant recognition using a block-windowed neural network architecture
Author :
Bryant, Benjamin D. ; Gowdy, John N.
Author_Institution :
Dept. of Electr. & Comput. Eng., Clemson Univ., SC, USA
Abstract :
The authors study several of the more well-known connectionist models, and how they address the time and frequency variability of the multispeaker, voiced-stop-consonant recognition task. Among the network architectures reviewed or tested for were the self-organizing feature maps (SOFM) architecture, various derivatives of this architecture, the time-delay neural network (TDNN) architecture, various derivatives of this architecture, and two frequency-and-time-shift-invariant architectures, frequency-shift-invariant TDNN, and the block-windowed neural network (FTDNN and BWNN). Voiced-stop speech was extracted from up to four dialect regions of the TIMIT continuous speech corpus for subsequent preprocessing and training and testing of network instances. Various feature representations were tested for their robustness in representing the voiced-stop consonants
Keywords :
delay circuits; learning (artificial intelligence); neural net architecture; self-organising feature maps; speech recognition; TIMIT continuous speech corpus; block-windowed neural network architecture; connectionist models; feature representations; frequency-shift-invariant TDNN; preprocessing; self-organizing feature maps; speaker independent recognition; time-delay neural network; training; voiced-stop-consonant recognition; Biological neural networks; Biological system modeling; Biomembranes; Data mining; Frequency; Nervous system; Neural networks; Robustness; Speech recognition; Testing;
Conference_Titel :
System Theory, 1993. Proceedings SSST '93., Twenty-Fifth Southeastern Symposium on
Conference_Location :
Tuscaloosa, AL
Print_ISBN :
0-8186-3560-6
DOI :
10.1109/SSST.1993.522811