• DocumentCode
    3109718
  • Title

    Development of Assamese Phonetic Engine: Some issues

  • Author

    Dev Sarma, Biswajit ; Sarma, M. ; Sarma, M. ; Prasanna, S.R.M.

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Indian Inst. of Technol., Guwahati, Guwahati, India
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The phonetic engine is a system that performs speech signal to symbol transformation. This work describes some issues in the development of an Assamese Phonetic Engine (PE). International phonetic alphabet (IPA) is used as the phonetic unit to transcribe the speech database collected in three different modes, namely, reading, lecture and conversation modes. Only reading mode data is used for training and Hidden markov model (HMM) is used to model each phonetic unit without imposing any language or contextual constraint. The trained HMMs are used to derive a sequence of phonetic units from a test speech signal. Accuracy of 47.31%, 45.30% and 36.13% is achieved in reading, lecture and conversation mode, respectively. Confusion among the phonetic units specific to Assamese are discussed. Issues related to different recording modes, language and native speaker dependencies are discussed. The speech data is also collected in Hindi from three different sets of speakers to study speaker, language and native dependancies. Accuracy of 40.5%, 36.10% and 29.61% is achieved in native speaker dependent, native speaker independent and non-native speaker independent cases, respectively.
  • Keywords
    hidden Markov models; natural language processing; speech synthesis; Assamese phonetic engine; HMM; Hindi; IPA; PE; hidden Markov model; international phonetic alphabet; native speaker dependent cases; native speaker independent cases; nonnative speaker independent cases; phonetic units; speech signal-to-symbol transformation; test speech signal; Acoustics; Engines; Hidden Markov models; Speech; Speech recognition; Testing; Training; Phonetic Engine (PE); Phonetic Unit; confusion; language dependancy; native speaker dependancy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2013 Annual IEEE
  • Conference_Location
    Mumbai
  • Print_ISBN
    978-1-4799-2274-1
  • Type

    conf

  • DOI
    10.1109/INDCON.2013.6725966
  • Filename
    6725966