DocumentCode :
1658042
Title :
Enhanced spectral features for spoken language identification
Author :
Ziaei, Ali ; Ahadi, Seyed Mohammad ; Yeganeh, Hojatollah
Author_Institution :
Electr. Eng. Dept., Amirkabir Univ. of Technol., Tehran
fYear :
2008
Firstpage :
1495
Lastpage :
1498
Abstract :
To date, systems for the identification of spoken languages have normally used spectral features such as the MFCC. It has also been shown that prosody features such as pitch and intensity have an important role to increase the accuracy of LID. In this paper, we used three novel features based on spectrum, in combination with MFCC and prosody features to improve language identification accuracy. These features are spectral centroid, Renyi entropy and Shannon entropy. The basic system used is GMM-LM with back-end classifier. Also we used a new method to convert the output of the language model scores to the new vectors to increase the LID performance. Using three new features and LM score conversion, an improved accuracy of about 8%, in comparison to the baseline system, on five of the languages of OGI-TS multilingual telephone speech corpus, was obtained.
Keywords :
feature extraction; speech processing; Renyi entropy; Shannon entropy; back-end classifier; language identification accuracy; multilingual telephone speech corpus; spectral centroid; spectral feature enhancement; spoken language identification; Cepstral analysis; Entropy; Fingerprint recognition; Laboratories; Linear discriminant analysis; Mel frequency cepstral coefficient; Natural languages; Probability distribution; Speech processing; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2178-7
Electronic_ISBN :
978-1-4244-2179-4
Type :
conf
DOI :
10.1109/ICOSP.2008.4697416
Filename :
4697416
Link To Document :
بازگشت