• DocumentCode
    3164337
  • Title

    Exploiting the harmonic structure for speech enhancement

  • Author

    Cho, Eunjoon ; Smith, Julius O., III ; Widrow, Bernard

  • Author_Institution
    Center for Comput. Res. in Music & Acoust., Stanford Univ., Stanford, CA, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4569
  • Lastpage
    4572
  • Abstract
    We provide a single channel speech enhancement method leveraging the harmonic structure of voiced speech. A sinusoidal model, based on the pitch of the speaker, is used to filter noisy speech and remove any noise components that lie between the harmonics. To remove noise that lie on each harmonic frequency, we use a noise estimation procedure that exploits spectral sparsity of voiced speech. By measuring the power spectrum at frequencies that correspond to the zero crossings of the windowing function, we can estimate the noise levels even in frames that have voiced speech. We also provide a constrained linear least squares formulation to reduce “musical noise” which arises from difficulty in estimating speech and noise power spectral densities. We show that our method yields high perceptual performance over existing methods, and can easily adapt to conditions in which the noise characteristics are constantly changing.
  • Keywords
    estimation theory; least squares approximations; speech enhancement; constrained linear least squares formulation; harmonic frequency; harmonic structure; musical noise; noise characteristics; noise components; noise estimation procedure; noise levels; noise power spectral densities; noisy speech; power spectrum; single channel speech enhancement method; sinusoidal model; speaker pitch; spectral sparsity; voiced speech; windowing function; zero crossings; Estimation; Harmonic analysis; Noise; Noise measurement; Power harmonic filters; Speech; Speech enhancement; Harmonic filter; Noise estimation; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288935
  • Filename
    6288935