• DocumentCode
    178384
  • Title

    Subband hybrid feature for multi-stream speech recognition

  • Author

    Feipeng Li

  • Author_Institution
    Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    2484
  • Lastpage
    2488
  • Abstract
    A subband hybrid (SBH) feature is developed for multi-stream (MS) speech recognition. The fullband speech signal is decomposed into multiple subbands, each covers about 3 Bark along the frequency. Speech signal is analyzed by a high-resolution filterbank of 4 filters/Bark and a low-resolution filterbank of 2 filters/Bark to facilitate the representation of both short-term spectral modulation and long-term temporal modulation within a frequency subband. Experiments on TIMIT corpus for English and RATS corpus for Arabic Levantine show that the SBH feature significantly enhances the amount of information being extracted from individual subbands. The MS system with performance monitor achieves a substantial gain in performance over the single-stream baseline.
  • Keywords
    channel bank filters; natural language processing; signal resolution; speech recognition; MS speech recognition; RATS corpus; SBH feature; TIMIT corpus; fullband speech signal; high-resolution filterbank; long-term temporal modulation; low-resolution filterbank; multistream speech recognition; short-term spectral modulation; subband hybrid feature; Frequency modulation; Monitoring; Noise; Rats; Speech; Speech recognition; multi-stream speech recognition; noise robustness; subband feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854047
  • Filename
    6854047