• DocumentCode
    3530540
  • Title

    Speech enhancement based on joint time-frequency segmentation

  • Author

    Tantibundhit, C. ; Pernkopf, F. ; Kubin, G.

  • Author_Institution
    MedIntelligence & Innovation Lab., Thammasat Univ., Bangkok
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4673
  • Lastpage
    4676
  • Abstract
    We present an algorithm to decompose speech into transient and non-transient components. Our algorithm, the joint time-frequency segmentation algorithm, uses the wavelet packet coefficients of the speech signal and represents them as tiles of a time-frequency representation adapted to the characteristics of the signal itself. Any wavelet packet coefficient, whose tiling height is larger than or equal to the tiling width is characterized as a transient coefficient and vice versa for the non-transient coefficient. The transient component is selectively amplified and recombined with the original speech to generate the modified speech with energy adjusted to be equal to the energy of the original speech. The psychoacoustic tests performed with fourteen human listeners show that the speech modification significantly improves speech intelligibility in background noise, i.e., for 10% absolute at 0d B to 31% absolute at -30 dB.
  • Keywords
    signal representation; speech enhancement; speech intelligibility; time-frequency analysis; wavelet transforms; background noise; human listeners; joint time-frequency segmentation; non-transient coefficient; psychoacoustic tests; speech enhancement; speech intelligibility; speech modification; speech signal; time-frequency representation; wavelet packet coefficients; Background noise; Band pass filters; Noise cancellation; Psychology; Signal processing algorithms; Speech enhancement; Speech processing; Testing; Time frequency analysis; Wavelet packets; Speech enhancement; speech intelligibility; transient component; wavelet packet transform;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960673
  • Filename
    4960673