DocumentCode :
1910205
Title :
Pitch Prediction from MFCC Vectors Using Support Vector Regression
Author :
Peng, Changping ; Liu, Wenjiu
Author_Institution :
Chinese Acad. of Sci., Beijing
fYear :
2007
fDate :
Aug. 30 2007-Sept. 1 2007
Firstpage :
342
Lastpage :
346
Abstract :
Mel-frequency cepstral coefficients (MFCC) are proved to be the effective feature for speech recognition and speaker recognition, while pitch frequency is also one of the favorite prosodic features. This work manages to bridge them with hidden Markov model (HMM) and support vector regression (SVR). A set of speaker-independent HMMs is used to align the training data, so that the parameters of the local SVR models can be trained from the pooled data with the same label. To evaluate the accuracy of the regression functions, MFCC streams are extracted from the test data, and the pitch frequencies are predicted from them with the trained mapping functions. The comparison between the predicted pitch contour and the reference one from PRAAT proves the strong bind of pitch frequency to MFCC vectors. And the HMM and the SVR are the proper models to characterize the connection.
Keywords :
feature extraction; hidden Markov models; learning (artificial intelligence); regression analysis; speaker recognition; support vector machines; mel-frequency cepstral coefficient vectors; pitch frequency prediction; prosodic feature extraction; speaker recognition; speaker-independent hidden Markov model; speech recognition; support vector regression; trained mapping function; Hidden Markov models; Laboratories; Mel frequency cepstral coefficient; Pattern recognition; Risk management; Speaker recognition; Speech recognition; Support vector machine classification; Support vector machines; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1611-0
Electronic_ISBN :
978-1-4244-1611-0
Type :
conf
DOI :
10.1109/NLPKE.2007.4368053
Filename :
4368053
Link To Document :
بازگشت