DocumentCode
1652198
Title
Infinite kernel linear prediction for joint estimation of spectral envelope and fundamental frequency
Author
Yoshii, Kazutomo ; Goto, Misako
Author_Institution
Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba, Japan
fYear
2013
Firstpage
463
Lastpage
467
Abstract
This paper presents a new probabilistic formulation of linear prediction (LP) for jointly estimating the spectral envelope and fundamental frequency (F0) of a speech signal. A main problem of classical LP is that the peaks of the estimated envelope are highly biased toward the harmonic partials of a speech spectrum. To solve this problem, we propose a nonparametric Bayesian model called infinite kernel linear prediction (IKLP) based on a Gaussian process with multiple kernel learning. Our model can represent the periodicity of a speech signal by using a weighted sum of infinitely many periodic kernels that correspond to different F0s. We put a gamma process prior on the positive weights of those kernels and perform sparse learning to determine a predominant kernel indicating the F0 at the same time of spectral envelope estimation. The experimental results showed that our model can estimate spectral envelopes and F0s of speech and singing signals while identifying pitched segments.
Keywords
Bayes methods; Gaussian processes; learning (artificial intelligence); nonparametric statistics; spectral analysis; speech processing; F0; Gaussian process; IKLP; LP; gamma process; harmonic speech spectrum partials; infinite kernel linear prediction; joint spectral envelope and fundamental frequency estimation; multiple kernel learning; nonparametric Bayesian model; pitched segment identification; predominant kernel; probabilistic formulation; singing signals; sparse learning; speech signal processing; Bayes methods; Estimation; Harmonic analysis; Hidden Markov models; Kernel; Probabilistic logic; Speech; Bayesian nonparametrics; Gaussian and gamma processes; Linear prediction; kernel methods; source-filter model;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location
Vancouver, BC
ISSN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2013.6637690
Filename
6637690
Link To Document