Speech enhancement based on joint time-frequency segmentation

Author

Tantibundhit, C. ; Pernkopf, F. ; Kubin, G.

Author_Institution

MedIntelligence & Innovation Lab., Thammasat Univ., Bangkok

fYear

2009

fDate

19-24 April 2009

Firstpage

4673

Lastpage

4676

Abstract

We present an algorithm to decompose speech into transient and non-transient components. Our algorithm, the joint time-frequency segmentation algorithm, uses the wavelet packet coefficients of the speech signal and represents them as tiles of a time-frequency representation adapted to the characteristics of the signal itself. Any wavelet packet coefficient, whose tiling height is larger than or equal to the tiling width is characterized as a transient coefficient and vice versa for the non-transient coefficient. The transient component is selectively amplified and recombined with the original speech to generate the modified speech with energy adjusted to be equal to the energy of the original speech. The psychoacoustic tests performed with fourteen human listeners show that the speech modification significantly improves speech intelligibility in background noise, i.e., for 10% absolute at 0d B to 31% absolute at -30 dB.

Keywords

signal representation; speech enhancement; speech intelligibility; time-frequency analysis; wavelet transforms; background noise; human listeners; joint time-frequency segmentation; non-transient coefficient; psychoacoustic tests; speech enhancement; speech intelligibility; speech modification; speech signal; time-frequency representation; wavelet packet coefficients; Background noise; Band pass filters; Noise cancellation; Psychology; Signal processing algorithms; Speech enhancement; Speech processing; Testing; Time frequency analysis; Wavelet packets; Speech enhancement; speech intelligibility; transient component; wavelet packet transform;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location

Taipei

ISSN

1520-6149

Print_ISBN

978-1-4244-2353-8

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2009.4960673

Filename

4960673