Title :
Joint Time–Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
Author :
Tantibundhit, C. ; Pernkopf, F. ; Kubin, G.
Author_Institution :
Dept. of Electr. & Comput. Eng., Thammasat Univ., Pathumthani, Thailand
Abstract :
We develop an algorithm, the joint time-frequency segmentation algorithm, where the wavelet packet coefficients of the analyzed speech signal are represented as tiles of a time-frequency representation adapted to the characteristics of the signal itself. Further, our algorithm enables the decomposition of the speech signal into transient and non-transient components, respectively. Any block of wavelet packet coefficients, whose tiling height is larger than or equal to the tiling width belongs to the transient component and vice versa for the non-transient component. The transient component is selectively amplified and recombined with the original speech to generate the modified speech with energy adjusted to be equal to the original speech. The intelligibility of the original and modified speech is evaluated by 16 human listeners. Word recognition rate results show that the modified speech significantly improves speech intelligibility in background noise, i.e., by 10% absolute at 0 dB to 27% absolute at -30 dB.
Keywords :
speech enhancement; wavelet transforms; word processing; joint time-frequency segmentation algorithm; speech enhancement; speech signal; transient speech decomposition; wavelet packet coefficients; word recognition rate; Algorithm design and analysis; Signal analysis; Speech analysis; Speech enhancement; Speech recognition; Time frequency analysis; Wavelet analysis; Wavelet packets; Speech enhancement; joint time–frequency (TF) segmentation; speech intelligibility; transient component; wavelet packet transform;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2009.2035037