DocumentCode :
3015125
Title :
Rapid speaker adaptation using a probabilistic spectral mapping
Author :
Schwartz, Richard ; Chow, Yen-Lu ; Kubala, Francis
Author_Institution :
BBN Laboratories, Cambridge, MA
Volume :
12
fYear :
1987
fDate :
31868
Firstpage :
633
Lastpage :
636
Abstract :
This paper deals with rapid speaker adaptation for speech recognition. We introduce a new algorithm that transforms hidden Markov models of speech derived from one "prototype" speaker so that they model the speech of a new speaker. The Speaker normalization is accomplished by a probabilistic spectral mapping from one speaker to another. For a 350 word task with a grammar and using only 15 seconds of speech for normalization, the recognition accuracy is 97% averaged over 6 speakers. This accuracy would normally require over 5 minutes of speaker dependent training. We derive the probabilistic spectral transformation of HMMs, describe an algorithm to estimate the transformation, and present recognition results.
Keywords :
Context modeling; Degradation; Hidden Markov models; Laboratories; Probability density function; Prototypes; Robustness; Speech recognition; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
Type :
conf
DOI :
10.1109/ICASSP.1987.1169575
Filename :
1169575
Link To Document :
بازگشت