DocumentCode :
2985724
Title :
Pitch estimation using harmonic product spectrum derived from DCT
Author :
Sripriya, N. ; Nagarajan, T.
Author_Institution :
Dept. of Inf. Technol., SSN Coll. of Eng., Chennai, India
fYear :
2013
fDate :
22-25 Oct. 2013
Firstpage :
1
Lastpage :
4
Abstract :
Estimation of pitch from a given segment of speech plays an eminent role in various speech processing applications, such as speech coding, speech recognition, speaker recognition tasks, speech synthesis, etc. Even though, there are several efficient algorithms, estimation of pitch frequency from speech signals that are severely degraded by noise is still a challenging task. In this paper, we propose a robust framework for pitch estimation using harmonic product spectrum (HPS) derived from discrete cosine transform (DCT) of the signal. This novel method exploits the better decorrelating nature of the DCT spectrum that enables the pitch harmonics to appear sharper in its spectrum. Potentially, this facilitates accurate pitch estimation at lower order of the harmonic product spectrum when compared with DFT-based HPS. Systematic evaluation is carried out to analyze the performance of the proposed method in comparison with some of the successful algorithms, like DFT-based HPS, SIFT, and Cepstrum-based technique. The results clearly show that the proposed algorithm outperforms the other algorithms for speech signals that are severely corrupted by noise (low SNR). The effectiveness of this method for different durations of analysis window, various orders of HPS, and the refinements are also discussed.
Keywords :
discrete cosine transforms; speech processing; DCT; HPS; cepstrum based technique; discrete cosine transform; harmonic product spectrum; pitch estimation; speaker recognition; speech coding; speech processing applications; speech recognition; speech segmentation; speech signals; speech synthesis; Accuracy; Discrete Fourier transforms; Discrete cosine transforms; Estimation; Harmonic analysis; Noise; Speech; DCT; Harmonic product spectrum; Pitch;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON 2013 - 2013 IEEE Region 10 Conference (31194)
Conference_Location :
Xi´an
ISSN :
2159-3442
Print_ISBN :
978-1-4799-2825-5
Type :
conf
DOI :
10.1109/TENCON.2013.6718976
Filename :
6718976
Link To Document :
بازگشت