• DocumentCode
    316807
  • Title

    Improving environmental robustness of speech recognition using neural networks

  • Author

    Sirigos, John ; Fakotakis, Nikos ; Kokkinakis, George

  • Author_Institution
    Wire Commun. Lab., Patras Univ., Greece
  • Volume
    2
  • fYear
    1997
  • fDate
    2-4 Jul 1997
  • Firstpage
    575
  • Abstract
    This paper presents a method for improving speech recognition in noisy environment by using neural networks. Two multilayer perceptrons (MLPs) are used. The first MLP minimises the difference between noisy and clean speech and the second one measures the degree of noise in the speech signal and adjusts the time interval between subsequent frames of the processed speech signal accordingly. If we use the technique presented in this paper as a pre-processing stage of a speech recognition system we can extend the application of the system to different environments without re-training it. We need only to train the preprocessing stage with a small portion of noisy data which is created by conducting part of the original clean speech database used for training the speech recognizer through the desired environment. There is no need for creating a new database in the desired working environment. Our method was tested on a vowel spotting system, and is trained with two well known databases: TIMIT and NTIMIT. The evaluation of the system through a vowel spotting process, shows a significant improvement of the recognition rate of the system
  • Keywords
    learning (artificial intelligence); multilayer perceptrons; noise; speech processing; speech recognition; NTIMIT; TIMIT; automatic speech recognition; clean speech; environmental robustness; multilayer perceptrons; neural networks; noisy data; noisy speech; preprocessing stage; processed speech signal; recognition rate; speech database; speech recognition system; speech recognizer training; speech signal; time interval; vowel spotting system; Databases; Multilayer perceptrons; Neural networks; Noise measurement; Noise robustness; Signal processing; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing Proceedings, 1997. DSP 97., 1997 13th International Conference on
  • Conference_Location
    Santorini
  • Print_ISBN
    0-7803-4137-6
  • Type

    conf

  • DOI
    10.1109/ICDSP.1997.628414
  • Filename
    628414