DocumentCode :
2705007
Title :
Pitch Estimation using Models of Voiced Speech on Three Levels
Author :
Joho, D. ; Bennewitz, Maren ; Behnke, Sven
Author_Institution :
Dept. of Comput. Sci., Freiburg Univ., Germany
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
We present an algorithm for estimating the fundamental frequency in speech signals. Our approach incorporates models of voiced speech on three levels. First, we estimate the pitch for each time frame based on its harmonic structure using non-negative matrix factorization. The second level utilizes temporal pitch continuity to extract partial pitch contours. Thirdly, we incorporate statistics of the succession of voiced segments to aggregate partial contours to the final contour of an utterance. We evaluate our approach on the Keele database. The experimental results show the robustness of our method for noisy speech, and the good performance for clean speech in comparison with state-of-the-art algorithms.
Keywords :
matrix decomposition; speech processing; Keele database; harmonic structure; noisy speech; nonnegative matrix factorization; partial pitch contours; pitch estimation; speech signals; temporal pitch continuity; voiced segments; voiced speech; Aggregates; Computer science; Frequency estimation; Hidden Markov models; Matrix decomposition; Robustness; Speech analysis; Speech enhancement; Statistical learning; Statistics; matrix decomposition; pitch estimation; speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367260
Filename :
4218291
Link To Document :
بازگشت