Title :
Speaker based Language Independent Isolated Speech Recognition System
Author :
Shanthi, Therese S. ; Lingam, Chelpa
Author_Institution :
Thadomal Shahani Eng. Coll., Bandra(W) Affiliated to Univ. of Mumbai, Mumbai, India
Abstract :
This paper presents a speaker based Language Independent Isolated Speech Recognition System (LIISRS). The most popular feature extraction technique Mel Frequency Cepstral Coefficients (MFCC) is used for training the system. Representative specific features are identified using K-Means algorithm. Distortion measure is calculated using Euclidian distance function. Pitch contour characteristics are used to identify the language specific features. Decision rules are formed to recognize language and speech of the given input. Thus, the proposed system not only recognizes the speech but also the language in which the speech is uttered. The result shows a satisfactory performance when the training is carried using native language speakers. Digits from one to ten of seven different languages are taken as training samples. Results obtained using 12 MFCC features for overall word level accuracy is 90.02% and language recognition accuracy is 97.14%.
Keywords :
feature extraction; natural language processing; speaker recognition; Euclidian distance function; LIISRS; MFCC features; decision rules; distortion measure; feature extraction technique; k-means algorithm; mel frequency cepstral coefficients; native language speakers; pitch contour characteristics; speaker based language independent isolated speech recognition system; Accuracy; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; Euclidean Distance; K-Means Algorithm; Mel Frequency Cepstral Coefficients (MFCC); Pitch Contour;
Conference_Titel :
Communication, Information & Computing Technology (ICCICT), 2015 International Conference on
Conference_Location :
Mumbai
Print_ISBN :
978-1-4799-5521-3
DOI :
10.1109/ICCICT.2015.7045748