DocumentCode :
1652198
Title :
Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency
Author :
Yoshii, Kazutomo ; Goto, Misako
Author_Institution :
Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba, Japan
fYear :
2013
Firstpage :
463
Lastpage :
467
Abstract :
This paper presents a new probabilistic formulation of linear prediction (LP) for jointly estimating the spectral envelope and fundamental frequency (F0) of a speech signal. A main problem of classical LP is that the peaks of the estimated envelope are highly biased toward the harmonic partials of a speech spectrum. To solve this problem, we propose a nonparametric Bayesian model called infinite kernel linear prediction (IKLP) based on a Gaussian process with multiple kernel learning. Our model can represent the periodicity of a speech signal by using a weighted sum of infinitely many periodic kernels that correspond to different F0s. We put a gamma process prior on the positive weights of those kernels and perform sparse learning to determine a predominant kernel indicating the F0 at the same time of spectral envelope estimation. The experimental results showed that our model can estimate spectral envelopes and F0s of speech and singing signals while identifying pitched segments.
Keywords :
Bayes methods; Gaussian processes; learning (artificial intelligence); nonparametric statistics; spectral analysis; speech processing; F0; Gaussian process; IKLP; LP; gamma process; harmonic speech spectrum partials; infinite kernel linear prediction; joint spectral envelope and fundamental frequency estimation; multiple kernel learning; nonparametric Bayesian model; pitched segment identification; predominant kernel; probabilistic formulation; singing signals; sparse learning; speech signal processing; Bayes methods; Estimation; Harmonic analysis; Hidden Markov models; Kernel; Probabilistic logic; Speech; Bayesian nonparametrics; Gaussian and gamma processes; Linear prediction; kernel methods; source-filter model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6637690
Filename :
6637690
Link To Document :
بازگشت