• DocumentCode
    302564
  • Title

    Visual representation of the speech trace using neural networks

  • Author

    Gómez, P. ; Rodellar, V. ; Alvarez, A. ; Mayo, N. ; Rubio, F. ; Nieto, V. ; Pérez, M.M.

  • Author_Institution
    Fac. de Inf., Univ. Politecnica de Madrid, Spain
  • Volume
    3
  • fYear
    1996
  • fDate
    12-15 May 1996
  • Firstpage
    586
  • Abstract
    Through the present paper, a methodology to create Visual Representations of Speech for Speech Perception Enhancement Applications, is presented, based on the use of Time-Delay Neural Networks. The advantages of using Neural Networks for such purposes, come from a lower computational cost, and from an easier DSP or VLSI implementation. On the other hand, the main inconvenient found in using this technique, is the need for training to each specific Speaker. This requirement may be relaxed if proper normalization methods are used. The specific mathematical and computational issues introduced for such treatment are given, and a specific case for Computer-Aided Language Learning oriented to the Phonetic Specificities of English for Spanish Speakers is also presented and discussed. This specific technique may also be used in statistically normalizing Speech Data for Speech Recognition Systems
  • Keywords
    data visualisation; neural nets; speech enhancement; speech recognition; DSP; English; Spanish speaker; VLSI; computer-aided language learning; phonetic specificity; speech perception enhancement; speech recognition; speech trace; statistical data normalization; time-delay neural network; training; visual representation; Computational efficiency; Decoding; Digital signal processing; Humans; Natural languages; Neural networks; Oral communication; Speech enhancement; Speech processing; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 1996. ISCAS '96., Connecting the World., 1996 IEEE International Symposium on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    0-7803-3073-0
  • Type

    conf

  • DOI
    10.1109/ISCAS.1996.541664
  • Filename
    541664