• DocumentCode
    1550144
  • Title

    New approaches for domain transformation and parameter combination for improved accuracy in parallel model combination (PMC) techniques

  • Author

    Hung, Jeih-weih ; Shen, Jia-Lin ; Lee, Lin-shan

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    9
  • Issue
    8
  • fYear
    2001
  • fDate
    11/1/2001 12:00:00 AM
  • Firstpage
    842
  • Lastpage
    855
  • Abstract
    Parallel model combination (PMC) techniques have been very successful and popularly used in many applications to improve the performance of speech recognition systems under noisy environments. However, it is believed that some assumptions and approximations made in this approach, primarily in the domain transformation and parameter combination processes, are not necessarily accurate enough in certain practical situations, which may degrade the achievable performance of PMC. In this paper, the possible sources that cause the performance degradation in these processes are carefully analyzed and discussed. Three new approaches, including the truncated Gaussian approach and the split mixture approach for the domain transformation process and the estimated cross-term approach for parameter combination process, are proposed in this paper in order to handle these problems, minimize such degradation, and improve the accuracy of the PMC techniques. These proposed approaches were analyzed and discussed with two recognition tasks, one relatively simple, and the other more complicated and realistic. Both sets of experiments showed that these proposed approaches are able to provide significant improvements over the original PMC method, especially when the SNR condition is worse
  • Keywords
    Gaussian processes; hidden Markov models; speech recognition; transforms; HMM; PMC techniques; domain transformation; estimated cross-term approach; hidden Markov models; noisy environments; parallel model combination techniques; parameter combination; speech recognition systems; split mixture approach; truncated Gaussian approach; Additive noise; Cepstral analysis; Character recognition; Degradation; Loudspeakers; Maximum likelihood linear regression; Noise robustness; Performance analysis; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.966087
  • Filename
    966087