Visual representation of the speech trace using neural networks

Author

Gómez, P. ; Rodellar, V. ; Alvarez, A. ; Mayo, N. ; Rubio, F. ; Nieto, V. ; Pérez, M.M.

Author_Institution

Fac. de Inf., Univ. Politecnica de Madrid, Spain

Volume

3

fYear

1996

fDate

12-15 May 1996

Firstpage

586

Abstract

Through the present paper, a methodology to create Visual Representations of Speech for Speech Perception Enhancement Applications, is presented, based on the use of Time-Delay Neural Networks. The advantages of using Neural Networks for such purposes, come from a lower computational cost, and from an easier DSP or VLSI implementation. On the other hand, the main inconvenient found in using this technique, is the need for training to each specific Speaker. This requirement may be relaxed if proper normalization methods are used. The specific mathematical and computational issues introduced for such treatment are given, and a specific case for Computer-Aided Language Learning oriented to the Phonetic Specificities of English for Spanish Speakers is also presented and discussed. This specific technique may also be used in statistically normalizing Speech Data for Speech Recognition Systems

Keywords

data visualisation; neural nets; speech enhancement; speech recognition; DSP; English; Spanish speaker; VLSI; computer-aided language learning; phonetic specificity; speech perception enhancement; speech recognition; speech trace; statistical data normalization; time-delay neural network; training; visual representation; Computational efficiency; Decoding; Digital signal processing; Humans; Natural languages; Neural networks; Oral communication; Speech enhancement; Speech processing; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Circuits and Systems, 1996. ISCAS '96., Connecting the World., 1996 IEEE International Symposium on

Conference_Location

Atlanta, GA

Print_ISBN

0-7803-3073-0

Type

conf

DOI

10.1109/ISCAS.1996.541664

Filename

541664