• DocumentCode
    535046
  • Title

    A multi-stream speech recognition system based on the estimation of stream weights

  • Author

    Guo, Hongyu ; Chen, Qinghai ; Huang, Dongmei ; Guo, Hongyu ; Zhao, Xiaoqun

  • Author_Institution
    Sch. of Inf., Shanghai Ocean Univ., Shanghai, China
  • Volume
    7
  • fYear
    2010
  • fDate
    16-18 Oct. 2010
  • Firstpage
    3479
  • Lastpage
    3482
  • Abstract
    A multi-stream speech recognition framework based on the estimation of stream weights is proposed for robust speech recognition. First, two complementary acoustic features, MFCCs and LPCCs, were selected. Second, we modeled them separately by using Hidden Markov Models (HMMs), furthermore, formed two streams of this system. Last, we combined the likelihood outputs of the above two systems with weighting technique and obtained a better performance. Here we present a novel algorithm for computing the stream weights of the two feature streams based on the computation of intra-and inter-class distances. Experimental results obtained on Chinese Academy of Science speech database show that this system yields better recognition performance in all conditions. Using this multi-stream framework, we found that the word error rate was decreased by 5%.
  • Keywords
    hidden Markov models; speech recognition; complementary acoustic features; hidden Markov models; multi-stream speech recognition system; speech database; stream weights estimation; Computational modeling; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; hidden markov models; linear predictive cepstral coefficient; mel-frequency cepstral coefficient; multi-stream framework; speech recognition; stream weights;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image and Signal Processing (CISP), 2010 3rd International Congress on
  • Conference_Location
    Yantai
  • Print_ISBN
    978-1-4244-6513-2
  • Type

    conf

  • DOI
    10.1109/CISP.2010.5646753
  • Filename
    5646753