An approach to content-independent feature extraction for Chinese-Korean spoken language identification

Author

Lu Shi-Dan ; Cui Rong-Yi

Author_Institution

Dept. of Comput. Sci. & Technol., Yanbian Univ., Yanji, China

Volume

3

fYear

2011

fDate

10-12 June 2011

Firstpage

638

Lastpage

641

Abstract

A new classification feature extraction method for Chinese-Korean spoken language identification was proposed in this paper. Firstly, speech signal was divided into frame serial and the number of frames was counted. Furthermore, the ratio between short-time zero-crossing rate and short-time energy, i.e. short-time-frequency-energy-ratio (STFER), was computed, and the mean STFER per frame was treated as the classification feature to implement Chinese-Korean spoken language identification. Finally, the classification threshold was determined using information gain. Experimental results show that the proposed method is simpler than MFCC feature parameters and has better ability to identify spoken language with lower complexity, can be adopted in preprocessing procedure of language recognition.

Keywords

feature extraction; natural language processing; pattern classification; speech processing; speech recognition; Chinese-Korean spoken language identification; classification feature extraction method; content-independent feature extraction; frame serial; language recognition; short-time energy; short-time zero-crossing rate; short-time-frequency-energy-ratio; Mel frequency cepstrum coefficient (MFCC); information gain; short-time energy; short-time zero-crossing rate; short-time-frequency-energy-ratio (STFER);

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on

Conference_Location

Shanghai

Print_ISBN

978-1-4244-8727-1

Type

conf

DOI

10.1109/CSAE.2011.5952757

Filename

5952757