DocumentCode :
2330338
Title :
F0 Estimation Method for Singing Voice in Polyphonic Audio Signal Based on Statistical Vocal Model and Viterbi Search
Author :
Fujihara, Hiromasa ; Kitahara, Tetsuro ; Goto, Masataka ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.
Author_Institution :
Dept. of Intelligence Sci. & Technol., Kyoto Univ.
Volume :
5
fYear :
2006
fDate :
14-19 May 2006
Abstract :
This paper describes a method for estimating F0s of vocal from polyphonic audio signals. Because melody is sung by a singer in many musical pieces, the estimation of F0s of the vocal part is useful for many applications. Based on existing multiple-F0 estimation method, we evaluate the vocal probabilities of the harmonic structure of each F0 candidate. In order to calculate the vocal probabilities of the harmonic structure, we extract and resynthesize the harmonic structure by using a sinusoidal model and extract feature vectors. Then, we evaluate the vocal probability by using vocal and non-vocal Gaussian mixture models (GMMs). Finally, we track F0 trajectories using these probabilities based on Viterbi search. Experimental results show that our method improves estimation accuracy from 78.1% to 84.3%, which is 28.3% reduction of misestimation
Keywords :
Gaussian processes; audio signal processing; feature extraction; harmonic analysis; search problems; statistical analysis; F0 estimation method; Gaussian mixture models; Viterbi search; feature vectors extraction; harmonic structure; polyphonic audio signal; singing voice; statistical vocal model; vocal probabilities; Data mining; Educational programs; Feature extraction; Frequency estimation; Informatics; Instruments; Music information retrieval; Probability; Trajectory; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1661260
Filename :
1661260
Link To Document :
بازگشت