Development of Consonant-Vowel Recognition Systems for Indian languages: Bengali and Odia

Author

Manjunath, K.E. ; Kumar, S. B. Sunil ; Pati, Debadatta ; Satapathy, Biswajit ; Rao, K. Sreenivasa

Author_Institution

Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India

fYear

2013

fDate

13-15 Dec. 2013

Firstpage

1

Lastpage

6

Abstract

The basic goal of this work is to develop a Consonant-Vowel Recognition System (CVRS) for determining a sequence of Consonant-Vowel (CV) units present in a given speech utterance. In this work, we are focusing on developing CVRSs for Indian languages namely Bengali and Odia. This framework of developing CVRSs can be extended to any Indian languages. We have developed two separate CVRSs for Bengali and Odia languages. The CVRS is developed using read speech corpus. In this study, 67 CV classes for Bengali and 58 CV classes for Odia are used. Mel Frequency Cepstral Coefficients (MFCCs) are used as features for building models. The Vowel Onset Points (VOPs) are used as anchor points for marking syllable boundaries and for feature extraction. Support Vector Machines (SVMs) are used for building CV models. The performance of the developed CVRSs are evaluated in speaker dependent and speaker independent modes. In speaker dependent case, the best percentage accuracies of Bengali and Odia CVRSs are 49.48 and 69.66 respectively whereas in speaker independent case, the best percentage accuracies of Bengali and Odia CVRSs are 40.26 and 41.59 respectively.

Keywords

cepstral analysis; feature extraction; natural language processing; speech recognition; support vector machines; Bengali language; CV classes; CV models; CV unit sequence; CVRS development; CVRS performance evaluation; Indian languages; MFCC; Mel frequency cepstral coefficients; Odia language; SVMs; VOP; anchor points; consonant-vowel recognition system development; consonant-vowel unit sequence; feature extraction; read speech corpus; speaker dependent modes; speaker independent modes; speech utterance; support vector machines; syllable boundary marking; vowel onset points; Accuracy; Feature extraction; Hidden Markov models; Speech; Support vector machines; Training; Vectors; Consonant-Vowel recognition; International Phonetic Alphabet; Support Vector Machine; Syllable recognition; Vowel Onset Point;

fLanguage

English

Publisher

ieee

Conference_Titel

India Conference (INDICON), 2013 Annual IEEE

Conference_Location

Mumbai

Print_ISBN

978-1-4799-2274-1

Type

conf

DOI

10.1109/INDCON.2013.6726109

Filename

6726109