DocumentCode :
2022574
Title :
Speaker adaptation using improved speaker Markov models
Author :
Rigoll, Gerhard
Author_Institution :
NTT Human Interface Lab., Musashino-Shi, Tokyo, Japan
Volume :
2
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
566
Abstract :
An attempt has been made to develop improved and more sophisticated SMMs (speaker Markov models) capable of modeling the acoustic differences between two speakers in a more accurate way, thus leading to improved recognition rates for the adapted speech recognition system. The original SSM approach has been improved by the introduction of the following three features: the use of fenonic speaker Markov models, the introduction of phoneme-dependent SMM parameters, and the use of special weighting between the short original training data of the new speaker and the adapted training data of the reference speaker. It was found that the phoneme recognition performance of these improved SMMs can be more than twice as high as the performance of the original SMM approach, which has already led to satisfying adaptation results for a large-vocabulary speech recognition task.<>
Keywords :
Markov processes; adaptive systems; learning (artificial intelligence); speech recognition; vocabulary; adapted training data; fenonic speaker Markov models; phoneme recognition performance; phoneme-dependent SMM parameters; speaker adaptation; speech recognition system; weighting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319370
Filename :
319370
Link To Document :
بازگشت