Title :
Multi-layered features with SVM for Chinese accent identification
Author :
Hou, Jue ; Liu, Yi ; Zheng, Thomas Fang ; Olsen, Jesper ; Tian, Jilei
Author_Institution :
Div. of Technol. Innovation & Dev., Tsinghua Nat. Lab. for Inf. Sci. & Technol., Beijing, China
Abstract :
In this paper, we propose an approach of multi-layered feature combination associated with support vector machine (SVM) for Chinese accent identification. The multi-layered features include both segmental and suprasegmental information, such as MFCC and pitch contour, to capture the diversity of variations in Chinese accented speech. The pitch contour is estimated using cubic polynomial method to model the variant characters in different accents in Chinese. We train two GMM acoustic models in order to express the features of a certain accent. As the original criterion of the GMM model cannot deal with such multi-layered features, the SVM is utilized to make the decision. The effectiveness of the proposed approach was evaluated on the 863 Chinese accent corpus. Our approach yields a significant 10% relative error rate reduction compared with traditional approaches using sole feature at single level in Chinese accented speech identification.
Keywords :
Gaussian processes; polynomials; speaker recognition; support vector machines; Chinese accented speech; Chinese accented speech identification; GMM acoustic models; MFCC; SVM; cubic polynomial method; multilayered features; pitch contour; relative error rate reduction; support vector machine; Accuracy; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Support vector machines; Training;
Conference_Titel :
Audio Language and Image Processing (ICALIP), 2010 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-5856-1
DOI :
10.1109/ICALIP.2010.5685023