• DocumentCode
    1761008
  • Title

    Optimisation of multiple feature stream weights for distributed speech processing in mobile environments

  • Author

    Djamel, Addou ; Sid-Ahmed, Selouani

  • Author_Institution
    Fac. of Electron. & Comput. Sci., USTHB, Algiers, Algeria
  • Volume
    9
  • Issue
    4
  • fYear
    2015
  • fDate
    6 2015
  • Firstpage
    387
  • Lastpage
    394
  • Abstract
    Mobile environments are highly influenced by ambient noise that can cause a significant deterioration in speech recognition performance. In this study, a new framework integrating a noise-robust frontend (FE) in distributed speech recognition (DSR) is presented. Using the Aurora-2 speech database, the authors evaluate the impact of the proposed multidimensional acoustical analysis on the performance of the Mel-frequency-based European Telecommunications Standards Institute-advanced FE (AFE) combined with the Mel-line spectral frequencies (MLSFs) robust features for highly noisy speech. The stream weights of the resulting multi-stream hidden Markov models are optimised automatically by deploying a novel approach based on a discriminative model combination. Finally, these features are effectively transformed and reduced using the Karhunen-Loève transform. The proposed MLSF-based FE (MLSF-FE) is shown to exhibit a reduction in the relative error rate. Moreover, the proposed FE provides comparable recognition performance to the current DSR-AFE available in global system of mobile communications.
  • Keywords
    Karhunen-Loeve transforms; cellular radio; hidden Markov models; speech recognition; Aurora-2 speech database; DSR; Karhunen-Loeve transform; MLSF robust features; MLSF-FE; MLSF-based FE; Mel-frequency-based European Telecommunications Standards Institute-advanced FE; Mel-line spectral frequencies; ambient noise; discriminative model combination; distributed speech processing; distributed speech recognition; global system of mobile communications; highly-noisy speech; mobile environments; multidimensional acoustical analysis; multiple-feature stream weight optimisation; multistream hidden Markov model; noise-robust FE; noise-robust frontend; relative error rate; speech recognition performance;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IET
  • Publisher
    iet
  • ISSN
    1751-9675
  • Type

    jour

  • DOI
    10.1049/iet-spr.2013.0392
  • Filename
    7122408