DocumentCode :
187638
Title :
Sub-segmental, segmental and supra-segmental analysis of linear prediction residual signal for language identification
Author :
Nandi, Dipanjan ; Pati, Debadatta ; Rao, K. Sreenivasa
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol., Kharagpur, Kharagpur, India
fYear :
2014
fDate :
22-25 July 2014
Firstpage :
1
Lastpage :
6
Abstract :
In this work, excitation source information is explored for language identification (LID) task. The excitation signal is represented by linear prediction (LP) residual. Different aspects of the excitation source information can be captured by processing LP residual signal at sub-segmental, segmental and supra-segmental levels. Gaussian mixture modelling (GMM) technique is used to build the language models. Present LID study has been carried out on IITKGP-MLILSC speech database. Individually, the segmental level information provides good LID accuracy followed by sub-segmental and supra-segmental level information. Combined evidences from all three levels represent the complete excitation source information. Finally, a comparative study has been carried out between the vocal tract and excitation source features, which portrays the distinct nature of these two features. Combination of both the features, yield an improvement of 10.01% in LID accuracy than only excitation source information. This observation indicates the significance of excitation source information for LID task.
Keywords :
Gaussian processes; mixture models; natural language processing; prediction theory; speech processing; GMM; Gaussian mixture modelling; IITKGP-MLILSC speech database; LID task; LP residual signal; excitation signal; excitation source features; excitation source information; language identification; language models; linear prediction residual signal; subsegmental analysis; subsegmental level information; supra-segmental analysis; supra-segmental level information; vocal tract; Accuracy; Correlation; Feature extraction; Mel frequency cepstral coefficient; Production; Speech; IITKGP-MLILSC; LP residual; MFCC; segmental; sub-segmental; suprasegmental;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications (SPCOM), 2014 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-4666-2
Type :
conf
DOI :
10.1109/SPCOM.2014.6983974
Filename :
6983974
Link To Document :
بازگشت