DocumentCode
302564
Title
Visual representation of the speech trace using neural networks
Author
Gómez, P. ; Rodellar, V. ; Alvarez, A. ; Mayo, N. ; Rubio, F. ; Nieto, V. ; Pérez, M.M.
Author_Institution
Fac. de Inf., Univ. Politecnica de Madrid, Spain
Volume
3
fYear
1996
fDate
12-15 May 1996
Firstpage
586
Abstract
Through the present paper, a methodology to create Visual Representations of Speech for Speech Perception Enhancement Applications, is presented, based on the use of Time-Delay Neural Networks. The advantages of using Neural Networks for such purposes, come from a lower computational cost, and from an easier DSP or VLSI implementation. On the other hand, the main inconvenient found in using this technique, is the need for training to each specific Speaker. This requirement may be relaxed if proper normalization methods are used. The specific mathematical and computational issues introduced for such treatment are given, and a specific case for Computer-Aided Language Learning oriented to the Phonetic Specificities of English for Spanish Speakers is also presented and discussed. This specific technique may also be used in statistically normalizing Speech Data for Speech Recognition Systems
Keywords
data visualisation; neural nets; speech enhancement; speech recognition; DSP; English; Spanish speaker; VLSI; computer-aided language learning; phonetic specificity; speech perception enhancement; speech recognition; speech trace; statistical data normalization; time-delay neural network; training; visual representation; Computational efficiency; Decoding; Digital signal processing; Humans; Natural languages; Neural networks; Oral communication; Speech enhancement; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 1996. ISCAS '96., Connecting the World., 1996 IEEE International Symposium on
Conference_Location
Atlanta, GA
Print_ISBN
0-7803-3073-0
Type
conf
DOI
10.1109/ISCAS.1996.541664
Filename
541664
Link To Document