DocumentCode :
2798679
Title :
Kalman filter based speech synthesis
Author :
Quillen, Carl
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4618
Lastpage :
4621
Abstract :
Preliminary results are reported from a very simple speech-synthesis system based on clustered-diphone Kalman Filter based modeling of line-spectral frequency based features. Parameters were estimated using maximum-likelihood EM training, with a constraint enforced that prevented eigenvalue magnitudes in the transition matrix from exceeding 1. Frames of training data were assigned diphone unit labels by forced alignment with an HMM recognition system. The HMM cluster tree was also used for Kalman Filter unit cluster assignments. The result is a simple synthesis system that has few parameters, synthesizes intelligible speech without audible discontinuities, and that can be adapted using MLLR techniques to support synthesis of a broad panoply of speakers from a single base model with small amounts of training data. The result is interesting for embedded synthesis applications.
Keywords :
Kalman filters; hidden Markov models; maximum likelihood estimation; speech synthesis; HMM recognition system; Kalman filter; MLLR techniques; embedded synthesis; maximum-likelihood EM training; parameter estimation; speech synthesis; Hidden Markov models; High temperature superconductors; Kalman filters; Laboratories; Maximum likelihood estimation; Maximum likelihood linear regression; Power system modeling; Speech recognition; Speech synthesis; Training data; Kalman filtering; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495547
Filename :
5495547
Link To Document :
بازگشت