Title :
Rapid speaker adaptation using a probabilistic spectral mapping
Author :
Schwartz, Richard ; Chow, Yen-Lu ; Kubala, Francis
Author_Institution :
BBN Laboratories, Cambridge, MA
Abstract :
This paper deals with rapid speaker adaptation for speech recognition. We introduce a new algorithm that transforms hidden Markov models of speech derived from one "prototype" speaker so that they model the speech of a new speaker. The Speaker normalization is accomplished by a probabilistic spectral mapping from one speaker to another. For a 350 word task with a grammar and using only 15 seconds of speech for normalization, the recognition accuracy is 97% averaged over 6 speakers. This accuracy would normally require over 5 minutes of speaker dependent training. We derive the probabilistic spectral transformation of HMMs, describe an algorithm to estimate the transformation, and present recognition results.
Keywords :
Context modeling; Degradation; Hidden Markov models; Laboratories; Probability density function; Prototypes; Robustness; Speech recognition; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '87.
DOI :
10.1109/ICASSP.1987.1169575