• DocumentCode
    3081333
  • Title

    Malayalam Speech Recognition system and its application for visually impaired people

  • Author

    Anand, Anu V. ; Devi, P. Shobana ; Stephen, Jose ; Bhadran, V.K.

  • Author_Institution
    Centre for Dev. of Adv. Comput., Trivandrum, India
  • fYear
    2012
  • fDate
    7-9 Dec. 2012
  • Firstpage
    619
  • Lastpage
    624
  • Abstract
    This paper describes the development of state-of-the-art large vocabulary continuous speech recognition (LVCSR) system for the Malayalam language with an application for visually challenged. For an LVCSR, building a high accurate acoustic models and large-scale language models are the challenging task. Speech corpus for training the system is collected from 80 native speakers in room environment ensuring the speaker variance. Mel-frequency Cepstral Coefficients (MFCC) method is used as a front-end to extract acoustic features from the input signal. Acoustic model is built on 30 hours of speech data based on Hidden Markov Model (HMM). A hybrid model, integrating rule based and statistical method is used to handle pronunciation variations in the dictionary. The best configuration of the system achieved word accuracy of 75% in average. Accuracy of the system is further increased up to 80% in average, by implementing speaker adaptation technique. The developed system is integrated to OpenOffice Writer together with TTS for making it user friendly editor for visually challenged people.
  • Keywords
    cepstral analysis; feature extraction; hidden Markov models; natural language processing; speech recognition; HMM; LVCSR; MFCC; Malayalam speech recognition system; OpenOffice Writer; acoustic feature extraction; acoustic models; hidden Markov model; language models; large vocabulary continuous speech recognition; mel frequency cepstral coefficients; speaker adaptation technique; speech corpus; visually impaired people; Acoustics; Computational modeling; Dictionaries; Hidden Markov models; Speech; Speech recognition; Training; CMU SPHINX; HMM; LVCSR; Speech Recognition; Speech Recognition application; Voice enabled OpenOffice;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2012 Annual IEEE
  • Conference_Location
    Kochi
  • Print_ISBN
    978-1-4673-2270-6
  • Type

    conf

  • DOI
    10.1109/INDCON.2012.6420692
  • Filename
    6420692