• DocumentCode
    704625
  • Title

    Dialectal Assamese vowel speech detection using acoustic phonetic features, KNN and RNN

  • Author

    Sharma, Mridusmita ; Sarma, Kandarpa Kumar

  • Author_Institution
    Dept. of Electron. & Commun. Eng., Gauhati Univ., Guwahati, India
  • fYear
    2015
  • fDate
    19-20 Feb. 2015
  • Firstpage
    674
  • Lastpage
    678
  • Abstract
    The recognition of vowel phonemes plays an important role in the field of speech processing. Assamese is the major language of Assam and also the mother-tongue of the of the largest segment of the population of Assam. The standard Assamese language has four major dialects namely Central dialect, Eastern dialect, Goalpariya dialect and Kamrupi dialect. It has eight vowel phonemes which are /i/, /e/, /ε/, /a/, /n/, /c/, /o/ and /u/. In this paper, a comparative analysis between the Recurrent Neural Network (RNN) based algorithm and K-Nearest Neighbor (KNN) based algorithm is carried out for the recognition of the vowel sounds using the Acoustic Phonetic Features as the feature vector. Dialect wise recognition of the vowels is also carried out using the same feature vectors. A recognition rate of 97 % is obtained by using the KNN based algorithm for vowel recognition and an overall rate of 84.3% and 87% is obtained by RNN based algorithm and KNN based algorithm respectively for the dialectal Assamese vowel recognition. K-NN based approach gives better recognition rate than the ANN based approach.
  • Keywords
    natural languages; recurrent neural nets; signal classification; signal detection; speech recognition; Eastern dialect; Goalpariya dialect; KNN based algorithm; Kamrupi dialect; RNN based algorithm; acoustic phonetic features; central dialect; dialectal Assamese vowel speech detection; dialectal wise Assamese vowel recognition; feature vector; k-nearest neighbor based algorithm; recurrent neural network; speech processing; standard Assamese language; vowel phonemes; vowel sound recognition; Acoustics; Classification algorithms; Recurrent neural networks; Signal processing algorithms; Speech; Speech recognition; Training; ) Recognition; Acoustic Phonetic Features; Dialect; K-Nearest Neighbor (KNN; Recurrent Neural Network (RNN); Vowels;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Integrated Networks (SPIN), 2015 2nd International Conference on
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-5990-7
  • Type

    conf

  • DOI
    10.1109/SPIN.2015.7095270
  • Filename
    7095270