• DocumentCode
    239456
  • Title

    An alternating ℓp — ℓ2 projections algorithm (ALPA) for speech modeling using sparsity constraints

  • Author

    Adiga, Aniruddha ; Seelamantula, Chandra Sekhar

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Sci. Bangalore, Bangalore, India
  • fYear
    2014
  • fDate
    20-23 Aug. 2014
  • Firstpage
    291
  • Lastpage
    296
  • Abstract
    We address the problem of separating a speech signal into its excitation and vocal-tract filter components, which falls within the framework of blind deconvolution. Typically, the excitation in case of voiced speech is assumed to be sparse and the vocal-tract filter stable. We develop an alternating ℓp - ℓ2 projections algorithm (ALPA) to perform deconvolution taking into account these constraints. The algorithm is iterative, and alternates between two solution spaces. The initialization is based on the standard linear prediction decomposition of a speech signal into an autoregressive filter and prediction residue. In every iteration, a sparse excitation is estimated by optimizing an ℓp-norm-based cost and the vocal-tract filter is derived as a solution to a standard least-squares minimization problem. We validate the algorithm on voiced segments of natural speech signals and show applications to epoch estimation. We also present comparisons with state-of-the-art techniques and show that ALPA gives a sparser impulse-like excitation, where the impulses directly denote the epochs or instants of significant excitation.
  • Keywords
    deconvolution; filtering theory; iterative methods; least squares approximations; minimisation; regression analysis; speech processing; ℓp-norm-based cost; ALPA; alternating ℓp-ℓ2 projections algorithm; autoregressive filter; blind deconvolution; epoch estimation; excitation components; impulse-like excitation; iterative algorithm; natural speech signals; prediction residue; solution spaces; speech modeling; standard least-squares minimization problem; standard linear prediction decomposition; vocal-tract filter components; voiced segments; voiced speech; Deconvolution; Digital signal processing; Estimation; Signal processing algorithms; Signal to noise ratio; Speech; Standards; Deconvolution; Iteratively reweighted least-squares (IRLS) technique; Linear prediction; Sparsity constraints;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing (DSP), 2014 19th International Conference on
  • Conference_Location
    Hong Kong
  • Type

    conf

  • DOI
    10.1109/ICDSP.2014.6900673
  • Filename
    6900673