DocumentCode
2022564
Title
A new speaker adaptation technique using very short calibration speech
Author
Zhao, Yunxin
Author_Institution
Panasonic Technologies Inc., Santa Barbara, CA, USA
Volume
2
fYear
1993
fDate
27-30 April 1993
Firstpage
562
Abstract
A speaker adaptation technique based on the separation of speech spectra variation sources is developed for improving speaker-independent continuous speech recognition. The variation sources include speaker acoustic characteristics, phonologic characteristics, and contextual dependency of allophones. Statistical methods are formulated to normalize speech spectra based on speaker acoustic characteristics and then adapt mixture Gaussian density phone models based on speaker phonologic characteristics. Adaptation experiments using short calibration speech (5 s/speaker) have shown substantial performance improvement over the baseline recognition system. On a TIMIT test set, where the task vocabulary size is 853 and the test set perplexity is 104, the recognition word accuracy has been improved from 86.9% to 90.6% (28.2% error reduction). On a separate test set which contains an additional variation source of recording channel mismatch and with the test set perplexity of 101, the recognition word accuracy has been improved from 65.4% to 85.5% (58.1% error reduction).<>
Keywords
adaptive systems; calibration; speech recognition; vocabulary; TIMIT test set; calibration speech; contextual dependency of allophones; mixture Gaussian density phone models; performance; perplexity; phonologic characteristics; recognition word accuracy; speaker acoustic characteristics; speaker adaptation technique; speaker-independent continuous speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location
Minneapolis, MN, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.1993.319369
Filename
319369
Link To Document