Title :
An approach to content-independent feature extraction for Chinese-Korean spoken language identification
Author :
Lu Shi-Dan ; Cui Rong-Yi
Author_Institution :
Dept. of Comput. Sci. & Technol., Yanbian Univ., Yanji, China
Abstract :
A new classification feature extraction method for Chinese-Korean spoken language identification was proposed in this paper. Firstly, speech signal was divided into frame serial and the number of frames was counted. Furthermore, the ratio between short-time zero-crossing rate and short-time energy, i.e. short-time-frequency-energy-ratio (STFER), was computed, and the mean STFER per frame was treated as the classification feature to implement Chinese-Korean spoken language identification. Finally, the classification threshold was determined using information gain. Experimental results show that the proposed method is simpler than MFCC feature parameters and has better ability to identify spoken language with lower complexity, can be adopted in preprocessing procedure of language recognition.
Keywords :
feature extraction; natural language processing; pattern classification; speech processing; speech recognition; Chinese-Korean spoken language identification; classification feature extraction method; content-independent feature extraction; frame serial; language recognition; short-time energy; short-time zero-crossing rate; short-time-frequency-energy-ratio; Mel frequency cepstrum coefficient (MFCC); information gain; short-time energy; short-time zero-crossing rate; short-time-frequency-energy-ratio (STFER);
Conference_Titel :
Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-8727-1
DOI :
10.1109/CSAE.2011.5952757