Title :
Automatic text independent language identification using reduct set of feature vectors
Author :
Sadanandam, M. ; Prasad, V. Kamakshi
Author_Institution :
CSE Kakatiya Univ., Warangal, India
Abstract :
In this paper, robust features are proposed for spoken language identification (LID) system. 12 Mel frequency cepstral coefficients (MFCCs) and five formant frequencies are extracted from each short-time windowed speech signal. These features are concatenated to form 17-dimensional feature vectors. 8-dimensional reduct set is obtained from this 17-dimensional feature vector using rough set theory. This 8-dimensional reduct set is transformed into 15 dimensional new feature vectors using the approach followed in [1]. In both the training and testing phases of LID, these 15 dimensional feature vectors are used. Due to usage of reduct set, there is a significant reduction of time in training and testing phases of the proposed LID system. The experiments are carried out on speech database of Indian languages and the results are impressive.
Keywords :
audio databases; cepstral analysis; feature extraction; natural language processing; rough set theory; speech processing; text analysis; 15 dimensional feature vectors; 17-dimensional feature vectors; 8-dimensional reduct set; Indian languages; LID system; MFCC; Mel frequency cepstral coefficients; automatic text independent language identification; feature vector reduct set; formant frequency extraction; rough set theory; short-time windowed speech signal; speech database; spoken language identification system; testing phase; training phase; Feature extraction; Mel frequency cepstral coefficient; Speech; Testing; Training; Vectors; Formants; LID; Language Identification; MFCC; Reduct set and new feature vectors;
Conference_Titel :
Fuzzy Systems (FUZZ), 2013 IEEE International Conference on
Conference_Location :
Hyderabad
Print_ISBN :
978-1-4799-0020-6
DOI :
10.1109/FUZZ-IEEE.2013.6622465