Title :
Indian language identification using k-means clustering and support vector machine (SVM)
Author :
Verma, V.K. ; Khanna, Neha
Author_Institution :
Dept. of Electr. & Electron. Eng., Graphic Era Univ., Dehradun, India
Abstract :
Audio speech signal contains various important information regarding language spoken, speaker identification, emotion recognition, gender recognition and the phonetic information about the speech being spoken, etc. This paper presents an automatic language identification system using K-means clustering on Mel-Frequency Cepstral Coefficients (MFCCs) for feature extraction and Support Vector Machine (SVM) for classification. Use of K-means clustering for post-processing MFCC features before sending them to the classifier allows considerable reduction in the complexity of SVM classifier, which is otherwise unavoidable due to huge number of MFCC features from each speech signal. The performance of the proposed system is tested on a custom speech database of three Indian languages: English, Hindi, and Tibetian. The proposed system shows promising results with an average classification accuracy of 81% using small duration speech signals.
Keywords :
cepstral analysis; computational complexity; emotion recognition; feature extraction; natural language processing; pattern clustering; signal classification; speaker recognition; speech processing; support vector machines; English; Hindi; Indian language identification; MFCC features; SVM classifier; Tibetian; audio speech signal; automatic language identification system; complexity reduction; emotion recognition; feature extraction; gender recognition; k-means clustering; language spoken; mel-frequency cepstral coefficients; phonetic information; speaker identification; speech database; support vector machine; Accuracy; Databases; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech recognition; Support vector machines; Automatic Language Identification (LID); Indian language identification; K-means clustering; Mel-Frequency Cepstral Coefficient (MFCC); Support Vector Machine (SVM);
Conference_Titel :
Engineering and Systems (SCES), 2013 Students Conference on
Conference_Location :
Allahabad
Print_ISBN :
978-1-4673-5628-2
DOI :
10.1109/SCES.2013.6547533