DocumentCode :
2083815
Title :
Text independent language recognition system for indic languages with new features
Author :
Sadanandam, M. ; Nagesh, A. ; Prasad, V. Kamakshi ; Janaki, V.
Author_Institution :
CSE, Kakatiya Univ., Warangal, India
fYear :
2012
fDate :
18-20 Dec. 2012
Firstpage :
1
Lastpage :
5
Abstract :
Spoken Language Identification is a task of identifying the language of an unknown utterance of speech. This paper describes a text independent language identification system using vector quantization with new features derived from MFCC feature of speech signal with a common code book. In this work, MFCC feature vectors of speech signal are transformed into new feature vectors. This LID approach includes generation of a common codebook using vector quantization with new feature set, one for each language. The experiments are carried out on Indian languages consists of six languages namely Tamil, Hindi, Tamil, Marathi, Malayalam and Kannada.
Keywords :
natural language processing; speech recognition; vector quantisation; Hindi language; Indian language; Indic language; Kannada language; MFCC feature; Malayalam language; Marathi language; Mel frequency cepstral coefficient; Tamil language; speech utterance; spoken language identification; text independent language recognition system; vector quantization; Common codebook; LID for Indian Languages; Language Identification (LID); new features; vector quantization (VQ);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence & Computing Research (ICCIC), 2012 IEEE International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4673-1342-1
Type :
conf
DOI :
10.1109/ICCIC.2012.6510219
Filename :
6510219
Link To Document :
بازگشت