DocumentCode :
47256
Title :
BaNa: A Noise Resilient Fundamental Frequency Detection Algorithm for Speech and Music
Author :
Na Yang ; He Ba ; Weiyang Cai ; Demirkol, Ilker ; Heinzelman, Wendi
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Rochester, Rochester, NY, USA
Volume :
22
Issue :
12
fYear :
2014
fDate :
Dec. 2014
Firstpage :
1833
Lastpage :
1848
Abstract :
Fundamental frequency (F0) is one of the essential features in many acoustic related applications. Although numerous F0 detection algorithms have been developed, the detection accuracy in noisy environments still needs improvement. We present a hybrid noise resilient F0 detection algorithm named BaNa that combines the approaches of harmonic ratios and Cepstrum analysis. A Viterbi algorithm with a cost function is used to identify the F0 value among several F0 candidates. Speech and music databases with eight different types of additive noise are used to evaluate the performance of the BaNa algorithm and several classic and state-of-the-art F0 detection algorithms. Results show that for almost all types of noise and signal-to-noise ratio (SNR) values investigated, BaNa achieves the lowest Gross Pitch Error (GPE) rate among all the algorithms. Moreover, for the 0 dB SNR scenarios, the BaNa algorithm is shown to achieve 20% to 35% GPE rate for speech and 12% to 39% GPE rate for music. We also describe implementation issues that must be addressed to run the BaNa algorithm as a real-time application on a smartphone platform.
Keywords :
cepstral analysis; music; signal detection; speech processing; BaNa algorithm; GPE rate; SNR; Viterbi algorithm; additive noise; cepstrum analysis; cost function; gross pitch error rate; harmonic ratios; hybrid noise resilient F0 detection algorithm; music databases; noise and signal-to-noise ratio; noise resilient fundamental frequency detection algorithm; performance evaluation; smartphone platform; speech databases; Detection algorithms; Harmonic analysis; Noise; Noise measurement; Power harmonic filters; Speech; Speech processing; Cepstrum; fundamental frequency detection; harmonics; noise resilience; viterbi algorithm;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2014.2352453
Filename :
6884780
Link To Document :
بازگشت