• DocumentCode
    177989
  • Title

    A newem estimationof dynamic stream weights for coupled-HMM-based audio-visual ASR

  • Author

    Abdelaziz, Ahmed Hussen ; Zeiler, Steffen ; Kolossa, Dorothea

  • Author_Institution
    Digital Signal Process. Group, Ruhr-Univ. Bochum, Bochum, Germany
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    1527
  • Lastpage
    1531
  • Abstract
    Mutually deploying visual and acoustical information in automatic speech recognition systems increases their robustness against acoustical environmental effects like additive noise and reverberation. Optimal fusion of the audio and video streams requires dynamic adaptation of the relative contribution of each modality. This can be achieved by weighting each stream according to its reliability by an appropriate stream weight. In this paper we propose a new expectation maximization algorithm that estimates oracle frame-dependent stream weights for coupled-HMM-based audio-visual speech recognition. Moreover, we introduce a greedy optimization approach that reasonably initializes this algorithm. The proposed approach is evaluated on the Grid audio-visual database and results in an average relative word error rate reduction of 38% and 58% compared to grid search and Bayes fusion, respectively. The estimated oracle stream weights can be used instead of the conventional global fixed stream weights to improve the supervised training of stream weight estimators.
  • Keywords
    expectation-maximisation algorithm; optimisation; speech recognition; EM estimation; audio streams; automatic speech recognition systems; coupled HMM based audio visual ASR; dynamic stream weights; expectation maximization algorithm; greedy optimization approach; optimal fusion; oracle frame dependent stream weights; video streams; Acoustics; Conferences; Decision support systems; Speech; Speech processing; AVASR; CHMM; Stream weight;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853853
  • Filename
    6853853