DocumentCode
1550144
Title
New approaches for domain transformation and parameter combination for improved accuracy in parallel model combination (PMC) techniques
Author
Hung, Jeih-weih ; Shen, Jia-Lin ; Lee, Lin-shan
Author_Institution
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume
9
Issue
8
fYear
2001
fDate
11/1/2001 12:00:00 AM
Firstpage
842
Lastpage
855
Abstract
Parallel model combination (PMC) techniques have been very successful and popularly used in many applications to improve the performance of speech recognition systems under noisy environments. However, it is believed that some assumptions and approximations made in this approach, primarily in the domain transformation and parameter combination processes, are not necessarily accurate enough in certain practical situations, which may degrade the achievable performance of PMC. In this paper, the possible sources that cause the performance degradation in these processes are carefully analyzed and discussed. Three new approaches, including the truncated Gaussian approach and the split mixture approach for the domain transformation process and the estimated cross-term approach for parameter combination process, are proposed in this paper in order to handle these problems, minimize such degradation, and improve the accuracy of the PMC techniques. These proposed approaches were analyzed and discussed with two recognition tasks, one relatively simple, and the other more complicated and realistic. Both sets of experiments showed that these proposed approaches are able to provide significant improvements over the original PMC method, especially when the SNR condition is worse
Keywords
Gaussian processes; hidden Markov models; speech recognition; transforms; HMM; PMC techniques; domain transformation; estimated cross-term approach; hidden Markov models; noisy environments; parallel model combination techniques; parameter combination; speech recognition systems; split mixture approach; truncated Gaussian approach; Additive noise; Cepstral analysis; Character recognition; Degradation; Loudspeakers; Maximum likelihood linear regression; Noise robustness; Performance analysis; Speech recognition; Working environment noise;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.966087
Filename
966087
Link To Document