DocumentCode :
177637
Title :
PYIN: A fundamental frequency estimator using probabilistic threshold distributions
Author :
Mauch, Matthias ; Dixon, Sam
Author_Institution :
Centre for Digital Music, Queen Mary Univ. of London, London, UK
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
659
Lastpage :
663
Abstract :
We propose the Probabilistic YIN (PYIN) algorithm, a modification of the well-known YIN algorithm for fundamental frequency (F0) estimation. Conventional YIN is a simple yet effective method for frame-wise monophonic F0 estimation and remains one of the most popular methods in this domain. In order to eliminate short-term errors, outputs of frequency estimators are usually post-processed resulting in a smoother pitch track. One shortcoming of YIN is that such post-processing cannot fall back on alternative interpretations of the signal because the method outputs precisely one estimate per frame. To address this problem we modify YIN to output multiple pitch candidates with associated probabilities (PYIN Stage 1). These probabilities arise naturally from a prior distribution on the YIN threshold parameter. We use these probabilities as observations in a hidden Markov model, which is Viterbi-decoded to produce an improved pitch track (PYIN Stage 2). We demonstrate that the combination of Stages 1 and 2 raises recall and precision substantially. The additional computational complexity of PYIN over YIN is low. We make the method freely available online1 as an open source C++ library for Vamp hosts.
Keywords :
Viterbi decoding; computational complexity; frequency estimation; hidden Markov models; speech processing; statistical distributions; PYIN algorithm; Vamp hosts; Viterbi; YIN threshold parameter; computational complexity; framewise monophonic F0 estimation; fundamental frequency estimation; fundamental frequency estimators; hidden Markov model; open source C++ library; pitch track; post-processing; probabilistic YIN algorithm; probabilistic threshold distributions; short-term errors; Algorithm design and analysis; Databases; Frequency estimation; Hidden Markov models; Probabilistic logic; Signal processing algorithms; Smoothing methods; Pitch estimation; YIN; pitch tracking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6853678
Filename :
6853678
Link To Document :
بازگشت