• DocumentCode
    447207
  • Title

    A two-dimensional robust algorithm based on Mel-bank log-spectrum

  • Author

    Wang, Yizhou ; Wu, Ji ; Wang, Zuoying

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • Volume
    1
  • fYear
    2005
  • fDate
    12-14 Oct. 2005
  • Firstpage
    759
  • Lastpage
    763
  • Abstract
    In this paper, a novel technique named two-dimensional robust algorithm based on Mel-bank log-spectrum is presented which combines sub-band average log-spectrum enhancement algorithm and peak isolation algorithm in log-spectrum domain. Information on the long envelopes of time trajectories of log-spectral energies and robustness of the log-spectral peaks in frequency bands are used together to achieve more excellent performance in this algorithm. Comparative experiments on the Aurora 2 databases are performed between these techniques and the proposed two-dimensional algorithm. It is shown that this combined two-dimensional algorithm results in better recognition performance than each technique. Compared to the Aurora 2.0 baseline system, the word error rate reduction was 21.23% in the clean training case for this combined algorithm.
  • Keywords
    speech enhancement; speech recognition; Mel-bank log-spectrum; peak isolation algorithm; subband average log-spectrum enhancement algorithm; two-dimensional robust algorithm; Acoustic noise; Automatic speech recognition; Filtering; Frequency domain analysis; Robustness; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing; Wiener filter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications and Information Technology, 2005. ISCIT 2005. IEEE International Symposium on
  • Print_ISBN
    0-7803-9538-7
  • Type

    conf

  • DOI
    10.1109/ISCIT.2005.1566964
  • Filename
    1566964