DocumentCode
3527752
Title
Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend
Author
Chu, Wei ; Alwan, Abeer
Author_Institution
Dept. of Electr. Eng., Univ. of California, Los Angeles, CA
fYear
2009
fDate
19-24 April 2009
Firstpage
3969
Lastpage
3972
Abstract
In this paper, we propose an F0 Frame Error (FFE) metric which combines Gross Pitch Error (GPE) and Voicing Decision Error (VDE) to objectively evaluate the performance of fundamental frequency (F0) tracking methods. A GPE-VDE curve is then developed to show the trade-off between GPE and VDE. In addition, we introduce a model-based Unvoiced/Voiced (U/V) classification frontend which can be used by any F0 tracking algorithm. In the U/V classification, we train speaker independent U/V models, and then adapt them to speaker dependent models in an unsupervised fashion. The U/V classification result is taken as a mask for F0 tracking. Experiments using the KEELE corpus with additive noise show that our statistically-based U/V classifier can reduce VDE and FFE for the pitch tracker TEMPO in both white and babble noise conditions, and that minimizing FFE instead of VDE results in a reduction in error rates for a number of F0 tracking algorithms, especially in babble noise.
Keywords
signal classification; speaker recognition; frame error metric; fundamental frequency tracking; gross pitch error; noisy conditions; pitch tracker; speaker dependent model; tracking algorithm; unvoiced/voiced classification frontend; voicing decision error; Additive noise; Error analysis; Frequency; Noise measurement; Noise reduction; Speech analysis; Speech coding; Speech processing; Speech synthesis; Working environment noise; Evaluation Metrics; Fundamental Frequency; Noise Robustness; Pitch Tracking; Unvoiced/Voiced Classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960497
Filename
4960497
Link To Document