• DocumentCode
    1652198
  • Title

    Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency

  • Author

    Yoshii, Kazutomo ; Goto, Misako

  • Author_Institution
    Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba, Japan
  • fYear
    2013
  • Firstpage
    463
  • Lastpage
    467
  • Abstract
    This paper presents a new probabilistic formulation of linear prediction (LP) for jointly estimating the spectral envelope and fundamental frequency (F0) of a speech signal. A main problem of classical LP is that the peaks of the estimated envelope are highly biased toward the harmonic partials of a speech spectrum. To solve this problem, we propose a nonparametric Bayesian model called infinite kernel linear prediction (IKLP) based on a Gaussian process with multiple kernel learning. Our model can represent the periodicity of a speech signal by using a weighted sum of infinitely many periodic kernels that correspond to different F0s. We put a gamma process prior on the positive weights of those kernels and perform sparse learning to determine a predominant kernel indicating the F0 at the same time of spectral envelope estimation. The experimental results showed that our model can estimate spectral envelopes and F0s of speech and singing signals while identifying pitched segments.
  • Keywords
    Bayes methods; Gaussian processes; learning (artificial intelligence); nonparametric statistics; spectral analysis; speech processing; F0; Gaussian process; IKLP; LP; gamma process; harmonic speech spectrum partials; infinite kernel linear prediction; joint spectral envelope and fundamental frequency estimation; multiple kernel learning; nonparametric Bayesian model; pitched segment identification; predominant kernel; probabilistic formulation; singing signals; sparse learning; speech signal processing; Bayes methods; Estimation; Harmonic analysis; Hidden Markov models; Kernel; Probabilistic logic; Speech; Bayesian nonparametrics; Gaussian and gamma processes; Linear prediction; kernel methods; source-filter model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6637690
  • Filename
    6637690