Dialectal Assamese vowel speech detection using acoustic phonetic features, KNN and RNN

Author

Sharma, Mridusmita ; Sarma, Kandarpa Kumar

Author_Institution

Dept. of Electron. & Commun. Eng., Gauhati Univ., Guwahati, India

fYear

2015

fDate

19-20 Feb. 2015

Firstpage

674

Lastpage

678

Abstract

The recognition of vowel phonemes plays an important role in the field of speech processing. Assamese is the major language of Assam and also the mother-tongue of the of the largest segment of the population of Assam. The standard Assamese language has four major dialects namely Central dialect, Eastern dialect, Goalpariya dialect and Kamrupi dialect. It has eight vowel phonemes which are /i/, /e/, /ε/, /a/, /n/, /c/, /o/ and /u/. In this paper, a comparative analysis between the Recurrent Neural Network (RNN) based algorithm and K-Nearest Neighbor (KNN) based algorithm is carried out for the recognition of the vowel sounds using the Acoustic Phonetic Features as the feature vector. Dialect wise recognition of the vowels is also carried out using the same feature vectors. A recognition rate of 97 % is obtained by using the KNN based algorithm for vowel recognition and an overall rate of 84.3% and 87% is obtained by RNN based algorithm and KNN based algorithm respectively for the dialectal Assamese vowel recognition. K-NN based approach gives better recognition rate than the ANN based approach.

Keywords

natural languages; recurrent neural nets; signal classification; signal detection; speech recognition; Eastern dialect; Goalpariya dialect; KNN based algorithm; Kamrupi dialect; RNN based algorithm; acoustic phonetic features; central dialect; dialectal Assamese vowel speech detection; dialectal wise Assamese vowel recognition; feature vector; k-nearest neighbor based algorithm; recurrent neural network; speech processing; standard Assamese language; vowel phonemes; vowel sound recognition; Acoustics; Classification algorithms; Recurrent neural networks; Signal processing algorithms; Speech; Speech recognition; Training; ) Recognition; Acoustic Phonetic Features; Dialect; K-Nearest Neighbor (KNN; Recurrent Neural Network (RNN); Vowels;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing and Integrated Networks (SPIN), 2015 2nd International Conference on

Conference_Location

Noida

Print_ISBN

978-1-4799-5990-7

Type

conf

DOI

10.1109/SPIN.2015.7095270

Filename

7095270