• DocumentCode
    2880950
  • Title

    Recurrent neural network with backpropagation through time for speech recognition

  • Author

    Ahmad, Abdul Manan ; Ismail, Saliza ; Samaon, Den Fairol

  • Author_Institution
    Univ. Teknologi Malaysia, Malaysia
  • Volume
    1
  • fYear
    2004
  • fDate
    26-29 Oct. 2004
  • Firstpage
    98
  • Abstract
    Speech recognition and understanding have been studied for many years. The neural network is well-known as a technique that is able to classify nonlinear problems. Much research has been done in applying neural networks to solving the problem of recognizing speech such as Arabic. Arabic offers a number of challenges to speech recognition. We propose a fully-connected hidden layer between the input and state nodes and the output. We also investigate and show that this hidden layer makes the learning of complex classification tasks more efficient. We also investigate the difference between LPCC (linear predictive cepstrum coefficients) and MFCC (Mel-frequency cepstral coefficients) in the feature extraction process. The aim of the study was to observe the differences in the 29 letters of the Arabic alphabet from "alif" to "ya". The purpose of this research is to upgrade the knowledge and understanding of Arabic alphabet or words using a fully-connected recurrent neural network (FCRNN) and backpropagation through time (BPTT) learning algorithm. Six speakers (a mixture of male and female) in a quiet environment are used in training.
  • Keywords
    backpropagation; feature extraction; pattern classification; recurrent neural nets; speech recognition; Arabic alphabet; Arabic speech recognition; LPCC; MFCC; Mel-frequency cepstral coefficients; backpropagation through time learning algorithm; complex classification task learning; feature extraction; fully-connected hidden layer; fully-connected recurrent neural network; linear predictive cepstrum coefficients; Backpropagation; Computer science; Natural languages; Neural networks; Neurofeedback; Neurons; Recurrent neural networks; Software engineering; Speech recognition; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technology, 2004. ISCIT 2004. IEEE International Symposium on
  • Print_ISBN
    0-7803-8593-4
  • Type

    conf

  • DOI
    10.1109/ISCIT.2004.1412458
  • Filename
    1412458