DocumentCode
2021310
Title
Multi-layered features with SVM for Chinese accent identification
Author
Hou, Jue ; Liu, Yi ; Zheng, Thomas Fang ; Olsen, Jesper ; Tian, Jilei
Author_Institution
Div. of Technol. Innovation & Dev., Tsinghua Nat. Lab. for Inf. Sci. & Technol., Beijing, China
fYear
2010
fDate
23-25 Nov. 2010
Firstpage
25
Lastpage
30
Abstract
In this paper, we propose an approach of multi-layered feature combination associated with support vector machine (SVM) for Chinese accent identification. The multi-layered features include both segmental and suprasegmental information, such as MFCC and pitch contour, to capture the diversity of variations in Chinese accented speech. The pitch contour is estimated using cubic polynomial method to model the variant characters in different accents in Chinese. We train two GMM acoustic models in order to express the features of a certain accent. As the original criterion of the GMM model cannot deal with such multi-layered features, the SVM is utilized to make the decision. The effectiveness of the proposed approach was evaluated on the 863 Chinese accent corpus. Our approach yields a significant 10% relative error rate reduction compared with traditional approaches using sole feature at single level in Chinese accented speech identification.
Keywords
Gaussian processes; polynomials; speaker recognition; support vector machines; Chinese accented speech; Chinese accented speech identification; GMM acoustic models; MFCC; SVM; cubic polynomial method; multilayered features; pitch contour; relative error rate reduction; support vector machine; Accuracy; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Support vector machines; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio Language and Image Processing (ICALIP), 2010 International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-5856-1
Type
conf
DOI
10.1109/ICALIP.2010.5685023
Filename
5685023
Link To Document