DocumentCode
547378
Title
An approach to content-independent feature extraction for Chinese-Korean spoken language identification
Author
Lu Shi-Dan ; Cui Rong-Yi
Author_Institution
Dept. of Comput. Sci. & Technol., Yanbian Univ., Yanji, China
Volume
3
fYear
2011
fDate
10-12 June 2011
Firstpage
638
Lastpage
641
Abstract
A new classification feature extraction method for Chinese-Korean spoken language identification was proposed in this paper. Firstly, speech signal was divided into frame serial and the number of frames was counted. Furthermore, the ratio between short-time zero-crossing rate and short-time energy, i.e. short-time-frequency-energy-ratio (STFER), was computed, and the mean STFER per frame was treated as the classification feature to implement Chinese-Korean spoken language identification. Finally, the classification threshold was determined using information gain. Experimental results show that the proposed method is simpler than MFCC feature parameters and has better ability to identify spoken language with lower complexity, can be adopted in preprocessing procedure of language recognition.
Keywords
feature extraction; natural language processing; pattern classification; speech processing; speech recognition; Chinese-Korean spoken language identification; classification feature extraction method; content-independent feature extraction; frame serial; language recognition; short-time energy; short-time zero-crossing rate; short-time-frequency-energy-ratio; Mel frequency cepstrum coefficient (MFCC); information gain; short-time energy; short-time zero-crossing rate; short-time-frequency-energy-ratio (STFER);
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-8727-1
Type
conf
DOI
10.1109/CSAE.2011.5952757
Filename
5952757
Link To Document