DocumentCode
1910205
Title
Pitch Prediction from MFCC Vectors Using Support Vector Regression
Author
Peng, Changping ; Liu, Wenjiu
Author_Institution
Chinese Acad. of Sci., Beijing
fYear
2007
fDate
Aug. 30 2007-Sept. 1 2007
Firstpage
342
Lastpage
346
Abstract
Mel-frequency cepstral coefficients (MFCC) are proved to be the effective feature for speech recognition and speaker recognition, while pitch frequency is also one of the favorite prosodic features. This work manages to bridge them with hidden Markov model (HMM) and support vector regression (SVR). A set of speaker-independent HMMs is used to align the training data, so that the parameters of the local SVR models can be trained from the pooled data with the same label. To evaluate the accuracy of the regression functions, MFCC streams are extracted from the test data, and the pitch frequencies are predicted from them with the trained mapping functions. The comparison between the predicted pitch contour and the reference one from PRAAT proves the strong bind of pitch frequency to MFCC vectors. And the HMM and the SVR are the proper models to characterize the connection.
Keywords
feature extraction; hidden Markov models; learning (artificial intelligence); regression analysis; speaker recognition; support vector machines; mel-frequency cepstral coefficient vectors; pitch frequency prediction; prosodic feature extraction; speaker recognition; speaker-independent hidden Markov model; speech recognition; support vector regression; trained mapping function; Hidden Markov models; Laboratories; Mel frequency cepstral coefficient; Pattern recognition; Risk management; Speaker recognition; Speech recognition; Support vector machine classification; Support vector machines; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-1611-0
Electronic_ISBN
978-1-4244-1611-0
Type
conf
DOI
10.1109/NLPKE.2007.4368053
Filename
4368053
Link To Document