DocumentCode :
542222
Title :
Spline-based continuous-time pitch estimation
Author :
Jefremov, Andrei ; Kleijn, W. Bastiaan
Author_Institution :
Department of Speech, Music and Hearing, KTH (Royal Institute of Technology), 10044 Stockholm, Sweden
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
Pitch-synchronous speech coding algorithms can achieve low bit rates without compromising the quality. However, the effectiveness of pitch-synchronous coding depends strongly on the ability to estimate precisely and reliably the fundamental period of the speech signal. We present a novel pitch postprocessing method that significantly improves the accuracy and reliability of pitch estimation. In contrast to the classical schemes, the pitch is treated as a continuous function in time and amplitude. B-Spline signal processing, half wave rectification, and multi-stage, multi-resolution optimization are essential parts of the procedure. The performance of the method is evaluated objectively and subjectively using the Waveform Interpolation coder. The objective results show that, for voiced segments, the method significantly (60% on average) decreases the energy of the unvoiced component estimate compared to using an unprocessed pitch. Listening tests show a 90% preference of speech generated using our postprocessor over speech generated using a conventional method.
Keywords :
Auditory system; Optimization; Speech; Spline;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743723
Filename :
5743723
Link To Document :
بازگشت