• DocumentCode
    2220262
  • Title

    A novel voiced speech enhancement approach based on modulated periodic signal extraction

  • Author

    Triki, Mahdi ; Slock, Dirk T. M.

  • Author_Institution
    Commun. Syst. Lab., Sophia Antipolis, France
  • fYear
    2006
  • fDate
    4-8 Sept. 2006
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Most of the existing speech coding and speech enhancement techniques are based on the AR model and hence apply well to unvoiced speech. These same techniques are then applied to the voiced case as well by extrapolation. However, voiced speech is very structured so that a proper approach allows to go further than for unvoiced speech. We model a voiced speech segment as a periodic signal with (slow) global variation of amplitude and frequency (limited time warping). The bandlimited variation of global amplitude and frequency gets expressed through a subsampled representation and parameterization of the corresponding signals. Assuming additive white Gaussian noise, a Maximum Likelihood approach is proposed for the estimation of the model parameters and the optimization is performed in an iterative (cyclic) fashion that leads to a sequence of simple least-squares problems. Particular attention is paid to the estimation of the basic periodic signal, which can have a non-integer period, and the estimation of the amplitude signal with guaranteed positivity.
  • Keywords
    AWGN; extrapolation; iterative methods; maximum likelihood estimation; speech coding; speech enhancement; AR model; additive white Gaussian noise; amplitude signal variation; bandlimited variation; extrapolation; frequency variation; iterative fashion; least-squares problems; maximum likelihood approach; model parameter estimation; modulated periodic signal extraction; noninteger period; optimization; signal parameterization; signal representation; speech coding; unvoiced speech; voiced speech enhancement approach; voiced speech segment; Abstracts; Gold; Hidden Markov models; Iron; Modulation; Signal to noise ratio; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2006 14th European
  • Conference_Location
    Florence
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7071411