مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker adaptation using improved speaker Markov models

DocumentCode :

2022574

Title :

Speaker adaptation using improved speaker Markov models

Author :

Rigoll, Gerhard

Author_Institution :

NTT Human Interface Lab., Musashino-Shi, Tokyo, Japan

Volume :

fYear :

1993

fDate :

27-30 April 1993

Firstpage :

566

Abstract :

An attempt has been made to develop improved and more sophisticated SMMs (speaker Markov models) capable of modeling the acoustic differences between two speakers in a more accurate way, thus leading to improved recognition rates for the adapted speech recognition system. The original SSM approach has been improved by the introduction of the following three features: the use of fenonic speaker Markov models, the introduction of phoneme-dependent SMM parameters, and the use of special weighting between the short original training data of the new speaker and the adapted training data of the reference speaker. It was found that the phoneme recognition performance of these improved SMMs can be more than twice as high as the performance of the original SMM approach, which has already led to satisfying adaptation results for a large-vocabulary speech recognition task.<>

Keywords :

Markov processes; adaptive systems; learning (artificial intelligence); speech recognition; vocabulary; adapted training data; fenonic speaker Markov models; phoneme recognition performance; phoneme-dependent SMM parameters; speaker adaptation; speech recognition system; weighting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on

Conference_Location :

Minneapolis, MN, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.1993.319370

Filename :

319370

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2022574