DocumentCode
2852680
Title
Yet Another Algorithm for Pitch Tracking
Author
Kasi, Kavita ; Zahorian, Stephen A.
Author_Institution
Department of Electrical and Computer Engineering, Old Dominion University, Norfolk, VA 23529, USA
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
In this paper, we present a pitch detection algorithm that is extremely robust for both high quality and telephone speech. The kernel method for this algorithm is the “NCCF or Normalized Cross Correlation” reported by David Talkin [1]. Major innovations include: processing of the original acoustic signal and a nonlinearly processed version of the signal to partially restore very weak F0 components; intelligent peak picking to select multiple F0 candidates and assign merit factors; and, incorporation of highly rohust pitch contours obtained from smoothed versions of low frequency portions of spectrograms. Dynamic programming is used to find the “best” pitch track among all the candidates, using both local and transition costs. We evaluated our algorithm using the Keele pitch extraction reference database as “ground truth” for both “high quality” and “telephone” speech. For both types of speech, the error rates obtained are lower than the lowest reported in the literature.
Keywords
Feature extraction; Frequency measurement; Indexes; Inspection; MATLAB; Speech; Variable speed drives;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743729
Filename
5743729
Link To Document