DocumentCode :
1550144
Title :
New approaches for domain transformation and parameter combination for improved accuracy in parallel model combination (PMC) techniques
Author :
Hung, Jeih-weih ; Shen, Jia-Lin ; Lee, Lin-shan
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
9
Issue :
8
fYear :
2001
fDate :
11/1/2001 12:00:00 AM
Firstpage :
842
Lastpage :
855
Abstract :
Parallel model combination (PMC) techniques have been very successful and popularly used in many applications to improve the performance of speech recognition systems under noisy environments. However, it is believed that some assumptions and approximations made in this approach, primarily in the domain transformation and parameter combination processes, are not necessarily accurate enough in certain practical situations, which may degrade the achievable performance of PMC. In this paper, the possible sources that cause the performance degradation in these processes are carefully analyzed and discussed. Three new approaches, including the truncated Gaussian approach and the split mixture approach for the domain transformation process and the estimated cross-term approach for parameter combination process, are proposed in this paper in order to handle these problems, minimize such degradation, and improve the accuracy of the PMC techniques. These proposed approaches were analyzed and discussed with two recognition tasks, one relatively simple, and the other more complicated and realistic. Both sets of experiments showed that these proposed approaches are able to provide significant improvements over the original PMC method, especially when the SNR condition is worse
Keywords :
Gaussian processes; hidden Markov models; speech recognition; transforms; HMM; PMC techniques; domain transformation; estimated cross-term approach; hidden Markov models; noisy environments; parallel model combination techniques; parameter combination; speech recognition systems; split mixture approach; truncated Gaussian approach; Additive noise; Cepstral analysis; Character recognition; Degradation; Loudspeakers; Maximum likelihood linear regression; Noise robustness; Performance analysis; Speech recognition; Working environment noise;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.966087
Filename :
966087
Link To Document :
بازگشت