• DocumentCode
    2349095
  • Title

    Vowel, Digit and Continuous Speech Recognition Based on Statistical, Neural and Hybrid Modelling by Using ASRS_RL

  • Author

    Dumitru, Corneliu Octavian ; Gavat, Inge

  • Author_Institution
    Univ. Politehnica Bucharest, Bucharest
  • fYear
    2007
  • fDate
    9-12 Sept. 2007
  • Firstpage
    856
  • Lastpage
    863
  • Abstract
    In the first part of this paper a recognizer based on hidden Markov models (HMMs) is compared in the simple task of vowel recognition with a recognizer based on the multilayer perceptron (MLP). In this situation, we have obtained better results for the last recognizer, fact which highlights the advantage of the discriminative training of the perceptron versus the maximum likelihood training of the HMM. Because MLPs have problems with accommodating time sequences like speech, a combination of a HMM with a MLP could be a good idea. In the second part of the paper, the hybrid structure HMMMLP is compared with the simple HMM in a digit recognition task. The hybrid structure has recognition rates improved with around 2%. In the last part of the paper are describes the continuous speech recognition experiments for Romanian language, by using HMM modelling. The progresses concern enhancement of modelling by taking into account the context in form of triphones, improvement of speaker independence by applying a gender specific training and enlargement of the feature categories used to describe speech sequences. In order to easier handling the recognition experiments an Automatic Speech Recognition System for Romanian Language (ASRS_RL) was designed.
  • Keywords
    hidden Markov models; maximum likelihood estimation; multilayer perceptrons; natural language processing; speech recognition; Romanian language; automatic speech recognition system; continuous speech recognition; hidden Markov models; hybrid modelling; maximum likelihood training; multilayer perceptron; neural modelling; speech sequences; statistical modelling; vowel recognition; Speech recognition; HMM; Hybrid; LPC; MFCC; MLP; PLP;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    EUROCON, 2007. The International Conference on "Computer as a Tool"
  • Conference_Location
    Warsaw
  • Print_ISBN
    978-1-4244-0813-9
  • Electronic_ISBN
    978-1-4244-0813-9
  • Type

    conf

  • DOI
    10.1109/EURCON.2007.4400336
  • Filename
    4400336