• DocumentCode
    3113009
  • Title

    Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia

  • Author

    Manjunath, K.E. ; Kumar, S. B. Sunil ; Pati, Debadatta ; Satapathy, Biswajit ; Rao, K. Sreenivasa

  • Author_Institution
    Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The basic goal of this work is to develop a Consonant-Vowel Recognition System (CVRS) for determining a sequence of Consonant-Vowel (CV) units present in a given speech utterance. In this work, we are focusing on developing CVRSs for Indian languages namely Bengali and Odia. This framework of developing CVRSs can be extended to any Indian languages. We have developed two separate CVRSs for Bengali and Odia languages. The CVRS is developed using read speech corpus. In this study, 67 CV classes for Bengali and 58 CV classes for Odia are used. Mel Frequency Cepstral Coefficients (MFCCs) are used as features for building models. The Vowel Onset Points (VOPs) are used as anchor points for marking syllable boundaries and for feature extraction. Support Vector Machines (SVMs) are used for building CV models. The performance of the developed CVRSs are evaluated in speaker dependent and speaker independent modes. In speaker dependent case, the best percentage accuracies of Bengali and Odia CVRSs are 49.48 and 69.66 respectively whereas in speaker independent case, the best percentage accuracies of Bengali and Odia CVRSs are 40.26 and 41.59 respectively.
  • Keywords
    cepstral analysis; feature extraction; natural language processing; speech recognition; support vector machines; Bengali language; CV classes; CV models; CV unit sequence; CVRS development; CVRS performance evaluation; Indian languages; MFCC; Mel frequency cepstral coefficients; Odia language; SVMs; VOP; anchor points; consonant-vowel recognition system development; consonant-vowel unit sequence; feature extraction; read speech corpus; speaker dependent modes; speaker independent modes; speech utterance; support vector machines; syllable boundary marking; vowel onset points; Accuracy; Feature extraction; Hidden Markov models; Speech; Support vector machines; Training; Vectors; Consonant-Vowel recognition; International Phonetic Alphabet; Support Vector Machine; Syllable recognition; Vowel Onset Point;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    India Conference (INDICON), 2013 Annual IEEE
  • Conference_Location
    Mumbai
  • Print_ISBN
    978-1-4799-2274-1
  • Type

    conf

  • DOI
    10.1109/INDCON.2013.6726109
  • Filename
    6726109