Title :
Pitch Prediction from MFCC Vectors Using Support Vector Regression
Author :
Peng, Changping ; Liu, Wenjiu
Author_Institution :
Chinese Acad. of Sci., Beijing
fDate :
Aug. 30 2007-Sept. 1 2007
Abstract :
Mel-frequency cepstral coefficients (MFCC) are proved to be the effective feature for speech recognition and speaker recognition, while pitch frequency is also one of the favorite prosodic features. This work manages to bridge them with hidden Markov model (HMM) and support vector regression (SVR). A set of speaker-independent HMMs is used to align the training data, so that the parameters of the local SVR models can be trained from the pooled data with the same label. To evaluate the accuracy of the regression functions, MFCC streams are extracted from the test data, and the pitch frequencies are predicted from them with the trained mapping functions. The comparison between the predicted pitch contour and the reference one from PRAAT proves the strong bind of pitch frequency to MFCC vectors. And the HMM and the SVR are the proper models to characterize the connection.
Keywords :
feature extraction; hidden Markov models; learning (artificial intelligence); regression analysis; speaker recognition; support vector machines; mel-frequency cepstral coefficient vectors; pitch frequency prediction; prosodic feature extraction; speaker recognition; speaker-independent hidden Markov model; speech recognition; support vector regression; trained mapping function; Hidden Markov models; Laboratories; Mel frequency cepstral coefficient; Pattern recognition; Risk management; Speaker recognition; Speech recognition; Support vector machine classification; Support vector machines; Training data;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1611-0
Electronic_ISBN :
978-1-4244-1611-0
DOI :
10.1109/NLPKE.2007.4368053