Title :
Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend
Author :
Chu, Wei ; Alwan, Abeer
Author_Institution :
Dept. of Electr. Eng., Univ. of California, Los Angeles, CA
Abstract :
In this paper, we propose an F0 Frame Error (FFE) metric which combines Gross Pitch Error (GPE) and Voicing Decision Error (VDE) to objectively evaluate the performance of fundamental frequency (F0) tracking methods. A GPE-VDE curve is then developed to show the trade-off between GPE and VDE. In addition, we introduce a model-based Unvoiced/Voiced (U/V) classification frontend which can be used by any F0 tracking algorithm. In the U/V classification, we train speaker independent U/V models, and then adapt them to speaker dependent models in an unsupervised fashion. The U/V classification result is taken as a mask for F0 tracking. Experiments using the KEELE corpus with additive noise show that our statistically-based U/V classifier can reduce VDE and FFE for the pitch tracker TEMPO in both white and babble noise conditions, and that minimizing FFE instead of VDE results in a reduction in error rates for a number of F0 tracking algorithms, especially in babble noise.
Keywords :
signal classification; speaker recognition; frame error metric; fundamental frequency tracking; gross pitch error; noisy conditions; pitch tracker; speaker dependent model; tracking algorithm; unvoiced/voiced classification frontend; voicing decision error; Additive noise; Error analysis; Frequency; Noise measurement; Noise reduction; Speech analysis; Speech coding; Speech processing; Speech synthesis; Working environment noise; Evaluation Metrics; Fundamental Frequency; Noise Robustness; Pitch Tracking; Unvoiced/Voiced Classification;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960497