• DocumentCode
    3445627
  • Title

    Voiced/unvoiced speech discrimination in noise using Gabor atomic decomposition

  • Author

    Lobo, Arthur P. ; Loizou, Philipos C.

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Texas at Dallas, Richardson, TX, USA
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    A new algorithm is developed for voiced-unvoiced speech discrimination in noise. Short segments of speech are modeled as a sum of basis functions from a Gabor dictionary. In each iteration, a Gabor atom is fitted (using the matching pursuit algorithm) to the residual obtained by subtracting the best-fit Gabor atom from the previous residual. Multiple discriminant analysis is used to reduce the dimensionality of the vector of Gabor coefficients to give a low-dimensional feature vector for classification. A radial basis function neural network is trained on the reduced feature vector set to discriminate between voiced and unvoiced speech/silence segments. On a database of 62 sentences in 5-dB SNR speech-shaped noise, 84% correct classification accuracy was obtained.
  • Keywords
    feature extraction; learning (artificial intelligence); noise; radial basis function networks; signal classification; signal representation; speech processing; Gabor atomic decomposition; Gabor coefficients vector; Gabor dictionary; Gabor representation; SNR; best-fit Gabor atom; classification accuracy; low-dimensional feature vector; matching pursuit algorithm; multiple discriminant analysis; radial basis function neural network; sentences database; speech classification; speech-shaped noise; training; unvoiced speech/silence segments; voiced speech segments; voiced/unvoiced speech discrimination; Atomic measurements; Dictionaries; Matching pursuit algorithms; Pursuit algorithms; Radial basis function networks; Signal to noise ratio; Spatial databases; Speech analysis; Speech enhancement; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198907
  • Filename
    1198907