DocumentCode
704625
Title
Dialectal Assamese vowel speech detection using acoustic phonetic features, KNN and RNN
Author
Sharma, Mridusmita ; Sarma, Kandarpa Kumar
Author_Institution
Dept. of Electron. & Commun. Eng., Gauhati Univ., Guwahati, India
fYear
2015
fDate
19-20 Feb. 2015
Firstpage
674
Lastpage
678
Abstract
The recognition of vowel phonemes plays an important role in the field of speech processing. Assamese is the major language of Assam and also the mother-tongue of the of the largest segment of the population of Assam. The standard Assamese language has four major dialects namely Central dialect, Eastern dialect, Goalpariya dialect and Kamrupi dialect. It has eight vowel phonemes which are /i/, /e/, /ε/, /a/, /n/, /c/, /o/ and /u/. In this paper, a comparative analysis between the Recurrent Neural Network (RNN) based algorithm and K-Nearest Neighbor (KNN) based algorithm is carried out for the recognition of the vowel sounds using the Acoustic Phonetic Features as the feature vector. Dialect wise recognition of the vowels is also carried out using the same feature vectors. A recognition rate of 97 % is obtained by using the KNN based algorithm for vowel recognition and an overall rate of 84.3% and 87% is obtained by RNN based algorithm and KNN based algorithm respectively for the dialectal Assamese vowel recognition. K-NN based approach gives better recognition rate than the ANN based approach.
Keywords
natural languages; recurrent neural nets; signal classification; signal detection; speech recognition; Eastern dialect; Goalpariya dialect; KNN based algorithm; Kamrupi dialect; RNN based algorithm; acoustic phonetic features; central dialect; dialectal Assamese vowel speech detection; dialectal wise Assamese vowel recognition; feature vector; k-nearest neighbor based algorithm; recurrent neural network; speech processing; standard Assamese language; vowel phonemes; vowel sound recognition; Acoustics; Classification algorithms; Recurrent neural networks; Signal processing algorithms; Speech; Speech recognition; Training; ) Recognition; Acoustic Phonetic Features; Dialect; K-Nearest Neighbor (KNN; Recurrent Neural Network (RNN); Vowels;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Integrated Networks (SPIN), 2015 2nd International Conference on
Conference_Location
Noida
Print_ISBN
978-1-4799-5990-7
Type
conf
DOI
10.1109/SPIN.2015.7095270
Filename
7095270
Link To Document