• DocumentCode
    471599
  • Title

    Identification of Speech Transients Using Variable Frame Rate Analysis and Wavelet Packets

  • Author

    Rasetshwane, Daniel M. ; Boston, J. Robert ; Li, Ching-Chung

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Pittsburgh Univ., PA
  • fYear
    2006
  • fDate
    Aug. 30 2006-Sept. 3 2006
  • Firstpage
    1727
  • Lastpage
    1730
  • Abstract
    Speech transients are important cues for identifying and discriminating speech sounds. Yoo et al. and Tantibundhit et al. were successful in identifying speech transients and, emphasizing them, improving the intelligibility of speech in noise. However, their methods are computationally intensive and unsuitable for real-time applications. This paper presents a method to identify and emphasize speech transients that combines subband decomposition by the wavelet packet transform with variable frame rate (VFR) analysis and unvoiced consonant detection. The VFR analysis is applied to each wavelet packet to define a transitivity function that describes the extent to which the wavelet coefficients of that packet are changing. Unvoiced consonant detection is used to identify unvoiced consonant intervals and the transitivity function is amplified during these intervals. The wavelet coefficients are multiplied by the transitivity function for that packet, amplifying the coefficients localized at times when they are changing and attenuating coefficients at times when they are steady. Inverse transform of the modified wavelet packet coefficients produces a signal corresponding to speech transients similar to the transients identified by Yoo et al. and Tantibundhit et al. A preliminary implementation of the algorithm runs more efficiently
  • Keywords
    speech intelligibility; speech processing; wavelet transforms; attenuation coefficients; inverse transform; speech intelligibility; speech sound discrimination; speech transient identification; subband decomposition; transitivity function; unvoiced consonant detection; variable frame rate analysis; wavelet coefficients; wavelet packet transform; Frequency; Hidden Markov models; Signal processing; Speech analysis; Speech enhancement; Speech recognition; Transient analysis; Wavelet analysis; Wavelet packets; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society, 2006. EMBS '06. 28th Annual International Conference of the IEEE
  • Conference_Location
    New York, NY
  • ISSN
    1557-170X
  • Print_ISBN
    1-4244-0032-5
  • Electronic_ISBN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/IEMBS.2006.260720
  • Filename
    4462106