• DocumentCode
    1252134
  • Title

    An RNN-based preclassification method for fast continuous Mandarin speech recognition

  • Author

    Sin-Horng Chen ; Yuan-Fu Liao

  • Author_Institution
    Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu
  • Volume
    6
  • Issue
    1
  • fYear
    1998
  • fDate
    1/1/1998 12:00:00 AM
  • Firstpage
    86
  • Lastpage
    90
  • Abstract
    A novel recurrent neural network-based (RNN-based) front-end preclassification scheme for fast continuous Mandarin speech recognition is proposed. First, an RNN is employed to discriminate each input frame for the three broad classes of initial, final, and silence. A finite state machine (FSM) is then used to classify the input frame into four states including three stable states of initial (I), final (F), and silence (S), and a transient (T) state. The decision is made based on examining whether the RNN discriminates well between classes. We then restrict the search space for the three stable states in the following DP search to speed up the recognition process. The efficiency of the proposed scheme was examined by simulations in which we incorporate it with a hidden Markov model-based (HMM-based) continuous 411 Mandarin based-syllables recognizer. The experimental results showed that it can be used in conjunction with the beam search to greatly reduce the computational complexity of the HMM recognizer while keeping the recognition rate almost undegraded
  • Keywords
    computational complexity; dynamic programming; finite state machines; hidden Markov models; natural languages; pattern classification; recurrent neural nets; satellite computers; search problems; speech processing; speech recognition; DP search; HMM; RNN front-end preclassification scheme; RNN-based preclassification method; beam search; computational complexity reduction; efficiency; experimental results; fast continuous Mandarin speech recognition; final; finite state machine; hidden Markov model; initial; input frame; recognition rate; recurrent neural network; search space; silence; simulations; stable states; syllables recognizer; transient state; Automata; Computational complexity; Computational modeling; Hidden Markov models; Law; Optimal matching; Recurrent neural networks; Speech processing; Speech recognition; Testing;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.650315
  • Filename
    650315